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PREFACE 


The following pages contain the papers which were presented and 
discussed in the formal technical and tutorial sessions of the 1964 Spring 
Computer Conference. The Conference theme, “Computers ’64; Problem- 
Solving in a Changing World,” is intended to suggest the significance of 
this year as the mid-point between the infancy of electronic digital com- 
putation (the first elements of ENIAC began operating in June 1944) and 
the Orwellian year 1984, which symbolizes to some critics an unhappy 
potential which might be achieved if we do not wisely guide our rapidly 
advancing technology. Society is increasingly concerned with the broad 
adjustments that must take place if man is to gain the maximum long-term 
advantage from the computer. Reflections of this concern and of the 
increasing uses to which computers are being applied are, among others, 
the papers dealing with social implications and the tutorial papers on hy- 
brid systems, compilers, and information retrieval. 

This volume, a product of the 25th in the series of Joint Computer 
Conferences, for the first time includes an 'index. Thanks for this useful 
addition are due to the authors and session chairmen, who were most -coop- 
erative in getting “final copy” submitted on schedule, and to our publisher, 
who assumed the burden of preparing the index. Appreciation must also 
be expressed for the contributions of many other persons, who participated 
as conference planners, panel members, and reviewers. A special acknowl- 
edgement is made to the members of the Technical Program Committee, 
who willingly assumed the heavy burden of assembling the formal program. 


Herbert R. Roller 
General Chairman 

1964 Spring Joint Computer Conference 
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1. Introduction. Twenty years ago, in 1943, 
there were no Electronic computers. Ten years 
ago, in 1953, a large number of Electronic cal- 
culators were in use, but the general purpose 
stored program electronic computer was still 
quite rare. The Coincident Current Magnetic 
Core memory which finally provided both re- 
liability and speed at reasonable cost had only 
just been developed, and was still a laboratory 
device. A number of specially designed, mostly 
one of a kind, computers were in operation at 
Universities and government research centers. 
Commercially, a few Univac I computers had 
been delivered and were operating with great 
reliability at rather low speed. A few IBM 701's 
provided high speed but with very poor reliabil- 
ity. In 1953 most computing was being done by 
the Card-Programmed Calculator, an ingenious 
mating of an Electromechanical Accounting 
Machine with an Electronic Calculating Punch. 

Between 1954 and 1958 many hundreds of 
Electronic computers were installed. This was 
the era of the Vacuum Tube Computer, with 
Magnetic Drum storage on lower priced ma- 
chines, and Magnetic Core storage on the larger 
more expensive ones. By 1959 the first transis- 
torized computers had been delivered, and the 
production of vacuum tube computers ceased 
almost immediately. Low cost magnetic core 
memories made the magnetic drum almost ob- 
solete except as auxiliary storage. Since 1959 
thousands of computers have been delivered and 


installed, including many hundreds of very 
large computers. Electronic Computing and 
Data-processing have become part of the every- 
day industrial and commercial environment, and 
Electronic Computer manufacturing and pro- 
gramming has become a multi-billion dollar in- 
dustry. 

Because of this rapid, almost explosive pat- 
tern of growi;h most system^s, both in the hard- 
ware and software area could not be adequately 
planned, and it was often not possible to make 
adequate evaluations of old systems in time to 
use such evaluations in the design of new ones. 

2. Developments up to 1957. The first pro- 
gramming systems were systems of subroutines. 
Subroutines were in use on the pre-electronic 
Mark I, a large electromechanical computer 
built as a joint effort by IBM and Harvard Uni- 
versity in the early forties.^ The EDSAC at the 
University of Manchester was probably the first 
stored program Electronic Computer in oper- 
ation (1949) . The classic text on programming 
the EDSAC, by Wilkes, Wheeler and Gill 2 
makes the subroutine library the basis of pro- 
gramming, and stresses the automatic relocation 
feature of the EDSAC, which makes such 
libraries easy to use. 

The IBM Card-Programmed Calculator, de- 
veloped between 1947 and 1949, was not very 
fast by modern standards, but was an extremely 
versatile device. Operation codes were deter- 
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mined by the wiring of control boards. In- 
genuity could be expended on designing boards 
to optimize performance on particular problems 
that taxed the capacity of the computer, or it 
could be expended on the design of general pur- 
pose boards for use in large classes of problems. 
A set of general purpose boards represented a 
language for the CPC, and a number of lan- 
guages were tried and used.^ Most Scientific 
installations finally settled on a wiring that 
made the CPC appear to be a floating point ma- 
chine with three-address logic, and with a 
standard vocabulary of built-in routines like 
Square Root, Sine, Exponential, etc. Experi- 
ence with the CPC systems had many influences 
on the programming systems that were designed 
for the stored-program computers that fol- 
lowed. 

One of the most important of the early auto- 
matic programming groups was associated with 
the Whirlwind project at MIT. Whirlwind, 
which was built between 1947 and 1951, was a 
fast but very basic computer. With only a 16 
bit word it had a very limited instruction code, 
and very limited high speed storage. Even rela- 
tively simple calculations required the use of 
multi-precision techniques. The very difficulty 
of using the machine in its own language pro- 
vided great incentive toward the development 
of programming languages. The Summer Ses- 
sion Computer at MIT was one of the early 
interpretive systems, designed to make the 
Whirlwind computer available to students at a 
summer course in computing at MIT. These 
early developments led to the design of a quite 
elaborate '‘Comprehensive System” for Whirl- 
wind.4 At the associated Lincoln Laboratories 
the groundwork was laid for the very large pro- 
gramming systems that were developed in con- 
nection with Sage and other military projects.^ 

The first large scale electronic computer 
available commercially was the Univac I 
(1951. The first Automatic Programming 
group associated with a commercial computer 
effort was the group set up by Dr. Grace Hopper 
in what was then the Eckert-Mauchly Computer 
Corp., and which later became the Univac Divi- 
sion of Sperry Rand. The Univac had been de- 
signed so as to be relatively easy to program in 
its own code. It was a decimal, alphanumeric 


machine, with mnemonic instructions that were 
easy to remember and use. The 12 character 
word made scaling of many fixed-point calcula- 
tions fairly easy. It was not always easy to see 
the advantage of assembly systems and com- 
pilers that were often slow and clumsy on a 
machine with only 12,000 characters of high 
speed storage (200 microseconds average ac- 
cess time per 12 character word). In spite of 
occasional setbacks, Dr. Hopper persevered in 
her belief that programming should and would 
be done in problem-oriented languages. Her 
group embarked on the development of a whole 
series of languages, of which the most used was 
probably A2, a compiler that provided a three 
address floating point system by compiling calls 
on floating point subroutines stored in main 
memory.®''^ The Algebraic Translator ATS 
(Math-Matic^) contributed a number of ideas 
to Algol and other compiler efforts, but its own 
usefulness was very much limited by the fact 
that Univac had become obsolete as a scientific 
computer before ATS was finished. The BO 
(Flow-Matic^’^) compiler was one of the major 
influences on the COBOL language development 
which will be discussed at greater length later. 
The first sort generators'^ were produced by the 
Univac programming group in 1951. They also 
produced what was probably the first large 
scale symbol manipulation program, a program 
that performed differentiation of formulas sub- 
mitted to it in symbolic form.^^ 

Another and quite independent group at Uni- 
vac concerned itself with an area that would 
now be called computer-oriented compilers. 
Anatol Holt and William Turanski developed a 
compiler and a concept that they called GP 
(Generalized Programming ^2)^ Their system 
assumed the existence of a very general sub- 
routine library. All programs would be written 
as if they were to be library programs, and the 
library and system would grow together. A 
program was assembled at compile time by the 
selection and modification of particular library 
programs and parts of library programs. The 
program as written by the programmer would 
provide parameters and specifications according 
to which the very general library programs 
would be made specific to a particular problem* 
Subroutines in the library were organized in 
hierarchies, in which subroutines at one level 
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could call on others at the next level. Specifica- 
tions and parameters could be passed from one 
level to the next. 

The system was extended and elaborated in 
the GPX system that they developed for Univac 
II. They were one of the early groups to give 
serious attention to some difficult problems rela- 
tive to the structure of programs, in particular 
the problem of segmenting of programs, and 
the related problem of storage allocation. 

Perhaps the most important contribution of 
this group was the emphasis that they placed on 
the programming system rather than on the 
programming language. In their terms, the 
machine that the programmer uses is not the 
hardware machine, but rather an extended ma- 
chine consisting of the hardware enhanced by 
a programming system that performs all kinds 
of service and library maintenance functions in 
addition to the translation and running of pro- 
grams. 

The IBM 701 (1952) was the first commer- 
cially marketed large scale binary computer. 
The best known programming language on the 
701 was Speedcode,^^ !^ a language that made 
the one address, binary, fixed point 701 appear 
to be a three address decimal floating point ma- 
chine with index registers. More than almost 
any other language, Speedcode demonstrated 
the extent to which users were willing to sacri- 
fice computer speed for- the sake of program- 
ming convenience. 

The PACT^^-i^ system on the 701 set a prec- 
edent as the first programming system designed 
by a committee of computer users. It also set a 
precedent for a situation which unfortunately 
has been quite common in the computer field. 
The computer for which the programming 
system was developed was already obsolete be- 
fore the programming system itself was com- 
pleted. PACT ideas had a great deal of 
influence on the later developments that took 
place under the auspices of the SHARE organ- 
ization. 

Delivery of the smaller, medium scale mag- 
netic-drum computers started in 1953, and by 
1955-56 they were a very important factor in 
the computer field. The IBM 650 was by far the 
most popular of the early drum computers. The 


650 was quite easy to program in its own lan- 
guage, and was programmed that way in many 
applications, especially in the data-processing 
area. As a scientific computer it lacked floating 
point hardware, a feature that was later made 
available. A number of interpretive floating 
point systems were developed, of which the most 
popular was the one designed at the Bell Tele- 
phone Laboratories.^'^ This was a three address 
floating point system with automatic looping 
and with built in Mathematical subroutines. It 
was a logical continuation of the line of systems 
that had started with the general purpose CPC 
boards, and had been continued in 701 Speed- 
code. It proved that on the right kind of com- 
puter an interpretive system can provide an 
efficient effective tool. Interpretive systems fell 
into disrepute for a number of years. They are 
making a very strong comeback at the present 
time in connection with a number of so-called 
micro-programmed computers that have re- 
cently appeared on the market. 

The 650 permitted optimization of programs 
by means of proper placement of instructions on 
the drum. Optimization was a very tedious job 
for the programmer, but could produce a very 
considerable improvement in program running 
time. A program called SOAP, a Symbolic 
Optimizer and Assembly Programs, combined 
the features of symbolic assembly and auto- 
matic optimization. There is some' doubt as to 
whether a symbolic assembly system would have 
received very general acceptance on the 650 at 
the time SOAP was introduced. The optimiza- 
tion feature was obviously valuable. Symbolic 
assembly by itself on a decimal machine without 
magnetic tape did not present advantages that 
v/ere nearly as obvious. 

The major competitor to the 650 among the 
early magnetic drum computers was the Data- 
tron 205, which eventually became a Burroughs 
product. It featured a 4000 word magnetic 
drum storage plus a small 80 word fast access 
memory in high speed loops on the drum. Effi- 
cient programs had to make very frequent use 
of block transfer instructions, moving both data 
and programs to high speed storage. A number 
of interpretive and assembly system were built 
to provide programmed floating point instruc- 
tions and some measure of automatic use of the 
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high speed loops. The eventual introduction of 
floating point hardware removed one of the 
principal advantages of most of these systems. 
The Datatron was the first commercial system 
to provide an index register and automatic re- 
location of subroutines, features provided by 
programming systems on other computers. For 
these reasons among others the use of machine 
code programming persisted through most of 
the productive lifetime of the Datatron 205. 

One of the first Datatron computers was in- 
stalled at Purdue University. One of the first 
Algebraic compilers was designed for the Data- 
tron by a group at Purdue headed by Dr. A1 
Perlis. This is another example of a compiler 
effort based on a firm belief that programming 
should be done in problem-oriented languages, 
even if the computer immediately available may 
not lend itself too well to the use of such lan- 
guages. A big problem in the early Datatron 
systems was the complete lack of alphanumeric 
input. The computer would recognize pairs of 
numbers as representing characters for print- 
ing on the flexowriter, but there was no way to 
produce the same pair of numbers by a single 
key stroke on any input preparation device. 
Until the much later development of new input- 
output devices, the input to the Purdue compiler 
was prepared by manually transcribing the 
source language into pairs of numbers. 

When Dr. Perlis moved to Carnegie Tech the 
some compiler was written for the 650, and was 
named IT.^^ IT made use of the alphanumeric 
card input of the 650, and translated from a 
simplified algebraic language into SOAP lan- 
guage as output. IT and languages derived 
from it became quite popular on the 650, and on 
other computers, and have had great influence 
on the later development of programming lan- 
guages. A language Fortransit provided trans- 
lation from a subset of Fortran into IT, whence 
a program would be translated into SOAP, and 
after two or more passes through SOAP it 
would finally emerge as a machine language 
program. The language would probably have 
been more popular if its translation were not 
such an involved and time-consuming process. 
Eventually other, more direct translators were 
built that avoided many of the intermediate 
passes. 


The 701 used a rather unreliable electrostatic 
tube storage system. When Magnetic core stor- 
age became available there was some talk about 
a 701M computer that would be an advanced 
701 with core storage. The idea of a 701M was 
soon dropped in favor of a completely new com- 
puter, the 704. The 704 was going to incorpo- 
rate into hardware many of the features for 
which programming systems had been devel- 
oped in the past. Automatic fioating point hard- 
ware and index registers would make interpre- 
tive systems like Speedcode unnecessary. 

Along with the development of the 704 hard- 
ware IBM set up a project headed by John 
Backus to develop a suitable compiler for the 
new computer. After the expenditure of about 
25 man years of effort they produced the first 
Fortran compiler.^^ ^® Fortran is in many ways 
the most important and most impressive devel- 
opment in the early history of automatic pro- 
gramming. 

Like most of the early hardware and soft- 
ware systems, Fortran was late in delivery, and 
didn’t really work when it was delivered. At 
first people thought it would never be done. 
Then when it was in field test, with many bugs, 
and with some of the most important parts un- 
finished, many thought it would never work. It 
gradually got to the point where a program in 
Fortran had a reasonable expectancy of com- 
piling all the way through and maybe even of 
running. This gradual change of status from 
an experiment to a working system was true of 
most compilers. It is stressed here in the case 
of Fortran only because Fortran is now almost 
taken for granted, as if it were built into the 
computer hardware. 

In 'the early days of automatic programming, 
the most important criterion on which a com- 
piler was judged was the efficiency of the object 
code. ''You only compile once, you run the 
object program many times,” was a statement 
often quoted to justify a compiler design phi- 
losophy that permitted the compiler to take as 
long as necessary, within reason, to produce 
good object code. The Fortran compiler on the 
704 applied a number of difficult and ingenious 
techniques in an attempt to produce object cod- 
ing that would be as good as that produced by 
a good programmer programming in machine 




PROGRAMMING SYSTEMS AND LANGUAGES 


5 


code. For many types of programs the coding 
produced is very good. Of course there are some 
for which it is not so good. In order to make 
effective use of index registers a very compli- 
cated index register assignment algorithm was 
used that involved a complete analysis of the 
flow of the program and a simulation of the 
running of the program using information ob- 
tained from frequency statements and from 
the flow analysis. This was very time consum- 
ing, especially on the relatively small initial 704 
configuration. Part of the index register opti- 
mization fell into disuse quite early but much 
of it was carried along into Fortran II and is 
still in use on the 704/9/90. In many programs 
it still contributes to the production of better 
code than can be achieved on the new Fortran 
IV compiler. 

Experience led to a gradual change of phi- 
losophy with respect to compilers. During de- 
bugging, compiling is done over and over again. 
One of the major reasons for using a problem 
oriented language is to make it easy to modify 
programs frequently on the basis of experience 
gained in running the programs. In many cases 
the total compile time used by a project is much 
greater than the total time spent running object 
codes. More recent compilers on many comput- 
ers have emphasized compiling time rather than 
run time efficiency. Some may have gone too far 
in that direction. 

It was the development of Fortran II that 
made it possible to use Fortran for large prob- 
lems without using excessive compiling time. 
Fortran II permitted a program to be broken 
down into subprograms which could be tested 
and debugged separately. With Fortran II in 
full operation, the use of Fortran spread very 
rapidly. Many 704 installations started to use 
nothing but Fortran. A revolution was taking 
place in the scientific computing field, but many 
of the spokesmen for the computer field were 
unaware of it. A number of major projects 
that were at crucial points in their development 
in 1957-1959 might have proceeded quite differ- 
ently if there was more general awareness of 
the extent to which the use of Fortran had been 
accepted in many major 704 installations. 
Among these are the Algol project and the SOS 
project which are discussed below. 


3. Algol and its Dialects. Until quite re- 
cently, large scale computers have been mainly 
an American phenomenon. Smaller computers 
were almost worldwide right from the begin- 
ning. An active computer organization GAMM 
had been set up in Europe, and in 1957 a num- 
ber of members of this organization were ac- 
tively interested in the design of Algebraic 
compilers for a number of machines. They de- 
cided to try to reach agreement on a common 
language for various machines, and made con- 
siderable progress toward the design of such a 
language. There are many obvious advantages 
to having generally accepted computer inde- 
pendent problem oriented languages. It was 
clear that a really international effort in this 
direction could only be achieved with United 
States participation. The President of GAMM 
wrote a letter to John Carr who was then Presi- 
dent of the ACM, suggesting that representa- 
tives of ACM and of GAMM meet together for 
the purpose of specifying an international lan- 
guage for the description of computing pro- 
cedures. 

The ACM up to that time had served as a 
forum for the presentation of ideas in all aspects 
of the computer field. It had never engaged in 
actual design of languages or systems. 

In response to the letter from GAMM, Dr. 
Carr appointed Dr. Perlis as chairman of a 
committee on programming languages. The 
committee set out to specify an Algebraic com- 
piler language that would represent the Ameri- 
can proposal at a meeting with representatives 
of GAMM at which an attempt would be made 
to reach agreement on an internationally ac- 
cepted language. The ACM committee consisted 
of representatives of the major computer manu- 
facturers, and representatives of several Uni- 
versities and research agencies that had done 
work in the compiler field. Probably the most 
active member of the committee was John 
Backus of IBM. He was probably the only mem- 
ber of the committee whose position permitted 
him to spend full time on the language design 
project, and a good part of the '‘American Pro- 
posar^ was based on his work. 

The ACM committee had a number of meet- 
ings without any very great sense of urgency. 
Subcommittees worked on various parts of the 
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language and reported back to the full commit- 
tee, and in general there was little argument or 
disagreement. There is after all very general 
agreement about the really basic elements of 
an Algebraic language. Much of the language 
is determined by the desirability of remaining 
as close as possible to Mathematical notation. 
This is tempered by experience in the use of 
computers and in the design of compilers which 
indicates some compromises between the de- 
mands of desirable notation and those of prac- 
tical implementation. 

At one meeting of the committee Dr. Bauer, 
one of the leaders of the GAMM effort, presented 
a report on the proposed European language. 
Among other things they proposed that English 
language key words, like begin, end, for, do, be 
used as a world-wide standard. Of course this 
is something the American committee would 
never have proposed, but it seemed quite rea- 
sonable to go along with the Europeans in this 
matter. Although some of the notations seemed 
strange, there were very few basic disagree- 
ments between what GAMM was proposing, and 
what the ACM committee was developing. Dr. 
Bauer remarked that the GAMM organization 
felt somewhat like the Russians who were meet- 
ing with constant rebuffs in an effort to set up a 
summit meeting. With such wide areas of 
agreement why couldn’t the ACM-GAMM meet- 
ing take place? 

Although there is quite general agreement 
about the basic elements of an Algebraic lan- 
guage, there is quite considerable disagreement 
about how far such a language should go, and 
about how some of the more advanced and more 
difficult concepts should be specified in the lan- 
guage. Manipulation of strings of symbols, 
direct handling of vectors, matrices, and mul- 
tiple precision quantities, ways to specify seg- 
mentation of problems, and the allocation and 
sharing of storage; these were some of the 
topics which could lead to long and lively dis- 
cussion. The ACM language committee decided 
that it was unreasonable to expect to reach an 
agreement on an international language em- 
bodying features of this kind at that time. It 
was decided to set up two subcommittees. One 
would deal with the specification of a language 
which included those features on which it was 
reasonable to expect a wide range of agreement. 


The other was to work toward the future, to- 
ward the specification of a language that would 
really represent the most advanced thinking in 
the computer field. 

The short-range committee was to set up a 
meeting in Europe with representatives of 
GAMM. Volunteers for work on this committee 
would have to arrange for the trip to Europe 
and back, and were therefore limited to those 
who worked for an organization that would be 
willing to sponsor such a trip. The ACM was 
asked to underwrite the trip for Dr. Perlis. 

The meeting of the ACM and GAMM sub- 
committees was held in Zurich in the spring of 
1958, and the result was a Preliminary report 
on an International Algebraic Language, which 
has since become popularly known as Algol 58.2^ 

With the use of Fortran already well estab- 
lished in 1958, one may wonder why the Ameri- 
can committee did not recommend that the 
international language be an extension of, or at 
least in some sense compatible with Fortran. 
There were a number of reasons. The most ob- 
vious has to do with the nature and the limita- 
tions of the Fortran language itself. A few 
features of the Fortran language are clumsy 
because of the very limited experience with 
compiler languages that existed when Fortran 
was designed. Most of Fortran’s most serious 
limitations occur because Fortran was not de- 
signed to provide a completely computer inde- 
pendent language ; it was designed as a compiler 
language for the 704. The handling of a number 
of statement types, in particular the Do and If 
statements, reflects the hardware constraints of 
the 704, and the design philosophy which kept 
these statements simple and therefore restricted 
in order to simplify optimization of object 
coding. 

Another and perhaps more important reason 
for the fact that the ACM committee almost 
ignored the existence of Fortran has to do with 
the predominant position of IBM in the large 
scale computer field in 1957-1958 when the 
Algol development started. Much more so than 
now there were no serious competitors. In the 
data processing field the Univac II was much 
too late to give any serious competition to the 
IBM 705. RCA’s Bizmac never really had a 
chance, and Honeywell’s Datamatic 1000, with 
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its 3 inch wide tapes, had only very few special- 
ized customers. In the Scientific field there were 
those who felt that the Univac 1103/1103a/1105 
series was as good or better than the IBM 701/ 
704/709. Uni vac's record of late delivery and 
poor service and support seemed calculated to 
discourage sales to the extent that the 704 had 
the field almost completely to itself. The first 
Algebraic compiler produced by the manufac- 
turer for the Univac Scientific computer, the 
1103a, was Unicode, a com^piler with many 
interesting features, that was not completed 
until after 1960, for computers that were al- 
ready obsolete. There were no other large scale 
scientific computers. There was a feeling on the 
part of a number of persons highly placed in the 
ACM that Fortran represented part of the IBM 
empire, and that any enhancement of the status 
of Fortran by accepting it as the basis of an 
international standard would also enhance 
IBM's monopoly in the large scale scientific 
computer field. 

The year 1958 in which the first Algol report 
was published, also marked the emergence of 
large scale high speed transistorized computers, 
competitive in price and superior in perform- 
ance to the vacuum tube computers in general 
use. At the time I was in charge of Program.- 
ming systems for the new model 2000 computers 
that Philco was preparing to market. An Alge- 
braic compiler was an absolute necessity, and 
there was never really any serious doubt that 
the language had to be Fortran.^^ 22 a ^he very 
first sales contracts for the 2000 specified that 
the computer had to be equipped with a com- 
piler that would accept 704 Fortran source 
decks essentially without change. Other manu- 
facturers, Honeywell, Control Data, Bendix, 
faced with the same problems, came to the same 
conclusion. Without any formal recognition, in 
spite of the attitude of the professional commit- 
tees, Fortran became the standard scientific 
computing language. Incidentally, the emer- 
gence of Fortran as a standard helped rather 
than hindered the development of a competitive 
situation in the scientific computer field. 

To go on with the Algol development, the 
years 1958-1959 were years in which many new 
computers were introduced. The time was ripe 
for experimentation in new languages. As men- 
tioned earlier there are many elements in com- 


mon in all Algebraic languages, and everyone 
who introduced a new language in those years 
called it Algol, or a dialect of Algol. The initial 
result of this first attempt at the standardiza- 
tion of Algebraic languages was the prolifera- 
tion of such languages in great variety. 

A very bright young programmer at Bur- 
roughs had some ideas about writing a very fast 
one pass compiler for Burroughs new 220 com- 
puter. The compiler has come to be known as 
Balgol. 

A compiler called ALGO was written for the 
Bendix G15 computer. At Systems Development 
Corporation, programming systems had to be 
developed for a large command and control 
system based on the IBM military computer 
( ANFSQ32) . The resulting Algebraic language 
with fairly elaborate data description facilities 
was JOVIAL23 (Jules Schwartz' own Version 
of the International Algebraic Language). By 
now compilers for JOVIAL ha-ve been written 
for the IBM 7090, the Control Data 1604, the 
Philco 2000, the Burroughs D825, and for sev- 
eral versions of IBM military computers. 

The Naval Electronics Laboratory at San 
Diego was getting a new Sperry Rand Com- 
puter, the Countess. With a variety of other 
computers installed and expected they stressed 
the description of a compiler in its own lan- 
guage to make it easy, among other things, to 
produce a compiler on one computer using a 
compiler on another. They also stressed very 
fast compiling times, at the expense of object 
code running times, if necessary. The language 
was called Neliac,24-25 a dialect of Algol. Com- 
pilers for Neliac are available on at least as 
great a variety of computers as for JOVIAL. 

The University of Michigan developed a com- 
piler for a language called Mad, the Michigan 
Algorithmic Decoder. 2^.27 They were quite un- 
happy at the slow compiling times of Fortran, 
especially in connection with short problems 
typical of student use of a computer at a Uni- 
versity. Mad was originally programmed for 
the 704 and has been adapted for the 7090. It 
too was based on the 1958 version of Algol. 

All of these languages derived from Algol 58 
are well established, in spite of the fact that 
the ACM GAMM committee continued its work 
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and issued its now well known report defining 
Algol 60.2® 

Algol 60, known simply as Algol, went con- 
siderably further than was anticipated in some 
of the early committee meetings. The language 
did not limit itself to those areas in which there 
exists almost universal agreement. Concepts 
like recursive subroutines, dynamic storage 
allocation, block structure, own variables and 
arrays, were introduced which require the in- 
clusion of rather complex structures in the run- 
ning programs produced by the compiler. With- 
out attempting any serious evaluation of these 
concepts here, I think it is fair to say that they 
are difficult, and their inclusion in an Algebraic 
language that is intended to be universal is con- 
troversial. They led to much debate about the 
difficult areas and tended to obscpre some of the 
more fundamental accomplishments of the 
Algol committee. Algol set an important prece- 
dent in language definition by presenting a 
rigorous definition of its syntax in the Backus 
normal form.^^ As compared to Fortran it con- 
tains a much more general treatment of itera- 
tive loops. It provides a good systematic han- 
dling of Boolean expressions and variables and 
of conditional statements. The most serious 
deficiency in Algol results from its complete 
lack of input-output specifications. The han- 
dling of input-output is one of the most impor- 
tant services provided by a compiler, and a 
general purpose Algebraic compiler language 
is not completely specified until its input-output 
language has been defined. 

Algol compilers have been written for many 
different computers, but with the exception 
of Burroughs no computer manufacturer has 
pushed it very strongly. It is very popular 
among University and mathematically oriented 
computer people especially in Europe. For some 
time in the United States it will probably re- 
main in its status as another available computer 
language. 

4. Data Processing Compilers, The largest 
user by far of data-processing equipment is the 
United States government. The government, by 
law and by design, avoids giving preferential 
treatment to any one computer manufacturer. 
More than any other computer user, the govern- 
ment is plagued by the problems caused by the 


lack of compatibility between different kinds of 
computing equipment, whether manufactured 
by the same or by different suppliers. 

In the spring of 1959,the office of the Secre- 
tary of Defense summoned representatives of 
the major manufacturers and users of data- 
processing equipment to a meeting in Washing- 
ton, to discuss the problem associated with the 
lack of standard programming languages in the 
data processing area. This was the start of 
the Committee on Data Systems Languages 
(CODASYL), that went on to produce COBOL, 
the common business oriented language. From 
the beginning their effort was marked by mis- 
sionary zeal for the cause of English language 
coding. 

Actually, there had been very little previous 
experience with Data processing compilers. 
Univac's B-0 or Flow-Matic,®-^ which was run- 
ning in 1956, was probably the first true Data- 
Processing compiler. It introduced the idea of 
file descriptions, consisting of detailed record 
and item descriptions, separate from the de- 
scription of program procedures. It also intro- 
duced the idea of using the English language as 
a programming language. 

It is remarkable to note that the Univac I on 
which Flow-Matic was implemented did not 
have the data-processing capabilities of a good 
sized 1401 installation. To add to the problems 
caused by the inadequacy of the computer, the 
implementation of the compiler was poor, and 
compilation was very very slow. There were 
installations that tried it and dropped it. Others 
used it, with the philosophy that even with 
compiling times measured in hours the total 
output of the installation was greater using the 
compiler than without it. Experience with 
Flow-Matic was almost the only experience 
available on Data Processing compilers prior to 
the launching of the COBOL project. 

A group at IBM had been working for some 
time on the Commercial Translator, and some 
early experience on that system was also avail- 
able. 

At the original Defense Department meeting 
there were two points of view. One group felt 
that the need was so urgent that it was neces- 
sary to work within the state of the art as it 
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then existed and to specify a common language 
on that basis as soon as possible. The other 
group felt that a better understanding of the 
problems of Data-Processing programming was 
needed before a standard language could be pro- 
posed. They suggested that a longer range ap- 
proach looking toward the specification of a 
language in the course of two or three years 
might produce better results. As a result two 
committees were set up, a short range commit- 
tee, and an intermediate range committee. The 
original charter of the short range committee 
was to examine existing techniques and lan- 
guages, and to report back to CODASYL with 
recommendations as to how these could be used 
to produce an acceptable language. The com- 
mittee set to work with a great sense of ur- 
gency. A number of companies represented had 
commitments to produce Data-processing com- 
pilers, and representatives of some of these be- 
came part of the driving force behind the com- 
mittee effort. The short range committee de- 
cided that the only way it could satisfy its 
obligations was to start immediately on the 
design of a new language. The committee be- 
came known as the COBOL committee, and their 
language was COBOL. 

Preliminary specifications for the new lan- 
guage were released by the end of 1959, and 
several companies, Sylvania, RCA, and Univac 
started almost immediately on implementation 
on the MOBIDIC, 501, and Univac II respec- 
tively. 

There then occurred the famous battle of the 
committees. The intermediate range committee 
had been meeting occasionally, and on one of 
these occasions they evaluated the early COBOL 
specifications and found them wanting. The pre- 
liminary specifications for Honey well's FACT^® 
compiler had become available, and the inter- 
mediate range committee indicated their feeling 
that Fact would be a better basis for a Common 
Business Oriented Language than Cobol. 

The COBOL committee had no intention of 
letting their work up to that time go to waste. 
With some interesting rhetoric about the course 
of history having made it impossible to con- 
sider any other action, and with the support of 
the Codasyl executive board, they affirmed Cobol 
as the Cobol. Of course it needed improvements. 


but the basic structure would remain. The 
charter of the Cobol committee was revised to 
eliminate any reference to short term goals and 
its effort has continued at an almost unbeliev- 
able rate from that time to the present. Com- 
puter manufacturers assigned programming 
Systems people to the committee, essentially on 
a full time basis. Cobol 60, the first official de- 
scription of the language, was followed by 61^^ 
and more recently by 61 extended.^^ 

Some manufacturers dragged their feet with 
respect to Cobol implementation. Cobol was an 
incomplete and developing language, and some 
manufacturers, especially Honeywell and IBM, 
were implementing quite sophisticated data 
processing compilers of their own which would 
become obsolete if Cobol were really to achieve 
its goal. In 1960 the United States government 
put the full weight of its prestige and purchas- 
ing power behind Cobol, and all resistance dis- 
appeared. This was accomplished by a simple 
announcement that the United States govern- 
ment would not purchase or lease computing 
equipment from any manufacturer unless a 
Cobol language compiler was available, or un- 
less the manufacturer could prove that the per- 
formance of his equipment would not be en- 
hanced by the availability of such a compiler. 
No such proof was ever attempted for large 
scale electronic computers. 

To evaluate Cobol in this short talk is out of the 
question. A number of quite good Cobol com- 
pilers have been written. The one on the 7090 
with which I have had some experience may be 
typical. It implements only a fraction, less than 
half I would guess, of the language described in 
the manual for Cobol 61 extended. No an- 
nouncement has been made as to whether or 
when the rest, some of which has only been pub- 
lished very recently, will be implemented. What 
is there is well done, and does many useful 
things, but the remaining features are impor- 
tant, as are some that have not yet been put into 
the manual and which may appear in Cobol 63. 

The language is rather clumsy to use; for 
example, long words like synchronized and 
computational must be written out all too fre- 
quently ; but many programmers are willing to 
put up with this clumsiness because, within its 
range of applicability the compiler performs 
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many important functions that would otherwise 
have to be spelled out in great detail. It is hard 
to believe that this is the last, or even very close 
to the last word in data processing languages. 
Before leaving Data Processing compilers I 
wish to say a few words about the development 
of the FACT compiler. 

In 1958 Honeywell, after almost leaving the 
computer business because of the failure of their 
Datamatic 1000 computer, decided to make an 
all out effort to capture part of the medium 
priced computer market with their Honeywell 
800 computer. The computer itself is very inter- 
esting but that is part of another talk. 

They started a trend, now well established, of 
contracting out their programming systems de- 
velopment, contracting with Computer Usage 
Co. for a Fortran language compiler. 

Most interesting from our point of view was 
their effort in the Data Processing field. On the 
basis of a contract with Honeywell, the Com- 
puter Sciences Corporation was organized. 
Their contract called for the design and produc- 
tion of a Data processing compiler they called 

FACT.30-33 

Fact combined the ideas of data processing 
generators as developed by Univac, GE Han- 
ford, Surge and 9PAC with the concepts of 
English language data processing compilers that 
had been developed in connection with Univac's 
Flow-Matic and IBM^s commercial translator. 

The result was a very powerful and very in- 
teresting compiler. When completed it con- 
tained over 250,000 three address instructions. 
Designed to work on configurations as small as 
4096 words of storage and 4 tape units it was 
not as fast as some more recent compilers on 
larger machines. 

The Fact design went far beyond the original 
COBOL specifications,^® and has had consider- 
able influence on the later COBOL development. 

Like all other manufacturers Honeywell has 
decided to go along with the COBOL language, 
and Fact will probably fall into disuse. 

5. Assemblers and Operating Systems, Sym- 
bolic assembly language has become an almost 
universal form for addressing a computer in a 
computer oriented language. 


After the first 704’s were delivered in 1956 
a number of users produced assembly routines 
for use on the computer. One of the early stand- 
ardization efforts involved a choice of a stand- 
ard assembly program to be used by Share, the 
704 users group. It is a sign of some of the 
thinking that was current then that the stand- 
ard chosen was UASAP.^® The first SAP was 
a very basic assembly system. It did practically 
nothing but one-to-one translation, and left the 
programmer in complete control of all of the 
parameters of the program. In the early days 
many users felt that this was all an assembly 
system should do. Some still feel that way, but 
on most computers the simple assembly system 
has been replaced by the full scale computer 
oriented compiler in which one-to-one code 
translation is augmented by extensive libraries 
of subroutines and generators and by allocation, 
segmentation, and other program-organization 
features.^"^ 

The word-macro-instruction apparently was 
coined in connection with the symbolic assembly 
systems that were developed for IBM's 702/705 
computers. These Autocoder^® systems with 
their large macro-instruction libraries have 
been used for huge amounts of data processing 
programming on a number of machines. 

Assembly systems gradually grew into or be- 
came incorporated into operating systems.^® '*® 
Perhaps the earliest monitor system on the 704 
was put into operation at the General Motors 
Research center.^^'^^ ^phe idea of automatic se- 
quencing of batches of jobs spread rapidly until 
it was almost universally used in connection 
with large computers. It made it possible for 
large computers to handle small jobs with rea- 
sonable efficiency and greatly extended their 
range of application. The idea of such systems 
was to run the computer without any stops, and 
to relegate the operator to occasional mount- 
ing of tapes, and otherwise to responding to 
very simple requests presented to him on the 
on-line printer. Under such a system debugging 
becomes a completely off-line process. The only 
response to trouble in a program is to dump and 
get on with the next program. 

At the end of 1956 IBM announced its new 
709 computer. The 709 was essentially a 704 
with internally buffered input and output. 



PROGRAMMING SYSTEMS AND LANGUAGES 


11 


As mentioned earlier, IBM was at its peak of 
penetration of the large scale scientific com- 
puter market at that time, and the rest of the 
industry watched with great interest as many 
of the best programming systems people repre- 
senting many leading scientific computer instal- 
lations met as a Share committee to design the 
very complex programming system which was 
eventually called SOS (Share Operating Sys- 
tem). 

The design of programming systems by large 
committees representing many companies and 
institutions has almost invariably led to dis- 
appointing results. SOS was no exception. 
Planned mainly by personnel of West Coast air- 
craft and research companies, it was to be writ- 
ten according to their specifications by the IBM 
programming systems activity on the East 
Coast. Separation of design and implementa- 
tion responsibility by 3000 miles is almost 
enough in itself to guarantee great difficulty, if 
not complete failure. In 1958 the chairman of 
the Share 709 system committee wrote, ‘The 
fundamental procedures used throughout the 
system will undoubtedly be retained in every 
installation. This has not been the case. The 
SOS system is now in use at only a very few in- 
stallations. There are many reasons, of which I 
would like to suggest just a few. SOS put all of 
its emphasis on the computer oriented program- 
ming system. The time during which SOS was 
being designed and implemented was the time 
during which the attitude toward Fortran was 
changing from polite skepticism to very general 
acceptance. By the time SOS was ir nearly full 
operation some installations were using almost 
nothing but Fortran. Apparently little or no 
effort had been expended on the problem of 
compatibility between SOS and Fortran. It was 
only in 1962 that an SOS system which handles 
Fortran was distributed by the Rand Corpora- 
tion.44 Their system accepts Fortran source 
programs, and produces the binary symbolic or 
squoze decks that can be combined with other 
programs produced by the SOS system. IBM 
boasted of over 50 man years of effort on the 
SOS system for the 709. They spent almost no 
effort on Fortran for the 709, on the theory that 
Fortran was developed for the 704 would be 
adequate. The Fortran II system that was 
originally distributed for the 709 took no ad- 


vantage of the fact that the 709 hardware per- 
mitted buffered input and output. The SOS 
system provided a very elaborate buffering 
system. 

SOS proposed to provide a system in which 
the programmer would need to know and use 
only one language, the compiler source lan- 
guage. One of its major achievements was the 
provision of source language modification of 
programs at load time without full recompila- 
tion. A very versatile debugging system was 
built around this feature. While this and other 
features of the system are extremely attractive, 
there is a serious question as to whether they 
are worth the price paid in system complexity, 
and in increased loading time. I think it is 
interesting to point out that a relatively simple 
assembly system, FAP, and a very basic oper- 
ating system, the Fortran Monitor System, both 
originated at customer installations and not by 
the manufacturer, became the most widely used 
systems on the 709/90 computers. Quite similar 
systems were introduced on competitive equip- 
ment, the Philco 2000 and the CDC 1604. Com- 
plexity and system rigidity no doubt contrib- 
uted to the fact that SOS was not generally ac- 
cepted. It will be interesting to follow the his- 
tory of a new and very complicated system, the 
IB SYS/IB JOB complex that has recently been 
introduced by IBM on the 7090 and related ma- 
chines. A critique of these systems is far be- 
yond the scope of this discussion. A few com- 
ments may be in order. IB JOB presents a very 
elaborate assembly system MAP, and transla- 
tors from two languages, FORTRAN IV and 
Cobol into Map. They are then translated into 
a language that is close to machine language, 
with the final steps of the translation left to a 
very complicated loader. T'le design, which 
calls for the translation of problem oriented 
source languages into an intermediate computer 
oriented source language is very attractive. By 
having the assembly system do most of the work 
of the compiler it is possible to have many of 
the features of the problem oriented language 
available by means of subroutine calls to those 
who prefer to write in assembly language. This 
design philosophy is attractive, but I think that 
it is wrong. Attractiveness and elegance should 
not be the determining design criteria for pro- 
duction compiling systems. Design of a system 
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is a compromise between many design criteria. 
One of the most important is keeping the sys- 
tem overhead cost low on the many problems 
that do not require the use of very sophisticated 
system features. The code produced by a com- 
piler like Fortran is straightforward simple 
code. The assembler for such code can be sim- 
ple and straightforward. The loading program 
can be designed to permit easy combination with 
programs produced by other systems. An as- 
sembler designed to aid in the production of 
large programming systems contains many fea- 
tures that are seldom used except in the coding 
of such systems. A wasteful mismatch may 
occur when the output of Fortran is fed through 
such an assembler. 

Not so very many years ago there was quite 
a bit of discussion as to whether general pur- 
pose operating systems should be designed and 
supplied by the manufacturers. Some users felt 
that the very great difference in the job mix 
and operating philosophy at the various instal- 
lations called for specially designed and tailored 
system programs. For a time the argument 
seem to be settled by the almost universal as- 
sumption that operating systems, and computer 
software in general were as much an obligation 
of the manufacturer as was the building of the 
computers themselves. I wonder if this assump- 
tion will be able to stand up in face of the rapid 
developments in the large computer field that 
will lead to computing systems that are very 
much more diverse, and very much more com- 
plex than those that are in general use today. 

In the large computer field multiprocessing 
and multiprogramming systems will soon be- 
come the rule rather than the exception. Many 
experiments in these directions are being tried 
with computers of the generation that is now 
coming to an end. Systems combining a 7094, 
a 1301 Disc unit and a 7040 will soon be com- 
monplace. There are a number of military sys- 
tems involving several computers and much 
peripheral equipment all working together un- 
der a common operating system. 

Among newer computers already delivered to 
customers there are several models that have 
been designed to make it possible and practical 
to run peripheral equipment on-line while simul- 
taneously carrying out independent computer 


processing tasks. The distinction between off- 
line and on-line tends to disappear on such 
systems, and the operating systems must be able 
to control equipment in many different modes. 
Systems already delivered that have some fea- 
tures that permit multiprogramming and multi- 
processing include the Honey^well 800, The Fer- 
ranti Atlas, The Burroughs 5000 and D825. 
There is some very interesting recent literature 
about programming systems for these com- 
puters.^5’4®’^'^ 

In the next generation of large computers it 
may be possible to implement true demand proc- 
essing systems. Demand systems have been 
advocated by many in opposition to batching 
systems. In a demand system problems are sub- 
mitted as they arise. The system controls the 
input of jobs and the scheduling of jobs by 
stacking jobs in queues according to length, 
priority, etc. A demand system requires multi- 
programming facilities, but also requires much 
more elaborate decision making on the part of 
an executive system than is present in most 
monitors today. 

The complexity required in some of these op- 
erating systems may seem to require that they 
be uniform systems designed and produced by 
the manufacturer. But, another feature that is 
being stressed more and more is modularity, 
which permits an almost unlimited variety in 
system configurations. It is very difficult to de- 
sign a single operating system that is appropri- 
ate for a computing system based on Disc stor- 
age, and also for one based on tapes or drums, 
and also for any combination of auxiliary de- 
vices. The problem will get more complicated 
when high speed storage at different levels is 
available in various quantities. It is quite rea- 
sonable to anticipate a system in the next few 
years that will have a very high speed film mem- 
ory, backed up by a fast core memory, backed 
up by a large and somewhat slower core mem- 
ory, backed up by high speed drums, then discs 
and tapes. It will be a real challenge to design 
programming systems that are valid for all 
combinations in such systems. 

In the early days one of the aims of the oper- 
ating system was to get the human being out of 
the system as much as possible. In a multi-pro- 
gramming system it is possible to allow human 
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intervention in the running of a program with- 
out any appreciable loss of computer time, since 
the computer will presumably have other pro- 
grams it can work on. There has been a great 
deal of publicity given to the experiments in the 
use of on-line consoles on present day systems.^^ 
Such consoles may be a routine part of many 
computing systems in a few years. 

In recent issues of a number of Computer 
publications there is an advertisement for a 
computer that claims to be faster than the 7090 
and costs only $215,000 dollars. Whether or not 
the claim is true, it does serve to emphasize the 
fact that the cost of computer processing capa- 
bility is going to go down rapidly. It is going 
to be economically feasible to put together ex- 
tremely large, varied, and complicated concen- 
trations of computer components. Program- 
ming systems are going to increase in number 
and complexity, and the job of the system pro- 
gram designer is going to remain 'as it always 
has been, very difficult, but very, very inter- 
esting. 

6. Bibliography. This is not intended to be a 
complete bibliography of the field of Program- 
ming Systems and Languages. It is rather a 
selected list of publications that may help to 
document the text. A great deal of reference 
material is contained in manuals published by 
computer manufacturers. It would serve no 
useful purpose to list such manuals here. Man- 
uals exist for just about every programming 
system mentioned in the text, and the mention 
of a language or system may be interpreted as 
an implied reference to the manual. I have at- 
tempted to include specific references to sources 
other than manuals for most systems discussed, 
but in many cases the system manuals provide 
better and more complete documentation. 
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BOUNDED CONTEXT TRANSLATION 
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All translators are syntax-directed in the 
sense that the translator must obviously recog- 
nize the various syntactic structures and the 
output of the translator is a function of the 
syntax of the language. The term syntax-di- 
rected is usually applied to a translator which 
contains a direct encoding of the syntax of the 
language, this direct encoding being used by 
the translator as data. The companion paper 
by Cheatham and Sattley is concerned with this 
type of translation.! In the other class of 
translators the syntax is essentially buried in 
the coding of the translator. Most of the 
algebraic languages in use are precedence 
grammars,^* ^ or close enough so that the prop- 
erties of precedence grammars are useful. 
Using the properties of precedence grammars, 
bounded context translation is possible. At each 
step in the scan of an expression in bounded 
context translation the decision as to what 
action to take next is a function of the symbol 
currently under scan and of N symbols on 
either side (where N is fixed for the particular 
language) . 

Most translators produce an intermediate 
form of the program before producing the final 
machine code. Once this intermediate form is 
produced, the distinction between syntax-di- 
rected translation and other methods disap- 
pears. The primary goal of the intermediate 
form is to encode the program in a form which 
is easily and efficiently processed by the com- 
puter. Most optimization algorithms, such as 


elimination of common subexpressions and 
optimum evaluation of Boolean expressions,^ 
are much simpler when applied to some inter- 
mediate form rather than to the original ex- 
pression or the final machine language version. 
The intermediate form exhibits the structure 
(i.e., which subexpressions are the operands of 
each operator) and the order of evaluation of 
the subexpressions. 

In this paper we will restrict ourselves to 
the source language defined in Appendix A. 
For the sake of brevity when we say expression 
we will mean either an expression, as defined 
in Appendix A, or an assignment statement. 
This language has no constants and identifiers 
are single letters, thus avoiding the problem 
of recognition. A translator for a language 
such as ALGOL would have a recognizer which 
would scan a statement and which, each time 
it was called, would recognize the next element 
in the input string. After recognizing an iden- 
tifier the recognizer would store it in a symbol 
table if it were not already there. The symbol 
table would also contain other information per- 
taining to the identifier such as type, address, 
dimension, etc. The recognizer would then re- 
turn a pointer to the place in the symbol table 
where that identifier and its properties were 
stored. A constant, after recognition, would be 
stored in a table of constants, given an internal 
identifier, and treated from that point on just 
as any other identifier would be. When an 
operator was recognized, a pointer to the place 
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in the operator table where the properties of 
that operator were stored would be returned. 
Thus we see that the recognizer would effec- 
tively reduce an expression of a language such 
as ALGOL to an expression of our language; 
that is, each element would be reduced to a 
single symbol. In general, the intermediate 
form does not contain the identifiers or opera- 
tors but the pointers supplied by the recognizer. 
However, for clarity, in the examples we will 
use the actual identifiers and operators. The 
recognizer is heavily dependent upon the details 
of the particular language being translated and 
such characteristics of the computer as char- 
acter representation, word size, etc. The algo- 
rithms to be discussed in this paper depend only 
upon a few very general properties of the source 
language. Most of the common compiler lan- 
guages have these properties so the techniques 
discussed here are essentially language inde- 
pendent, While the algorithms which we will 
describe here apply only to the simple expres- 
sions of Appendix A, several of them may be 
trivially generalized to translate more complex 
expressions and even the bulk of the ALGOL 
statements.'^'®*^^ 


Some representation of the tree form of an 
expression is used in several translators. Fig- 
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Figure 1. 




Ross's tree form 
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Figure 2. 


ure 1 shows the tree form of the expression 
A:=B*C-hD*(E — F) and the matrix repre- 
sentation of the tree® * The first column is 
the operator, the second column is the left 
hand operand, and the third column is the 
right-hand operand. An integer represents the 
result of the row with that number. The sub- 
expressions are to be evaluated in the sequence 
in which the rows are written. Figure 2 shows 
the tree form used by Ross.i^ The first three 
columns in the representation are the same as 
in Figure 1. There are two additional columns 
which explicitly state the sequence of sub- 
expression evaluation. The fourth column is 
the minor evaluation string pointer (dashed 
arrows in the tree) and the fifth column is the 
major evaluation string ppinter (solid arrows 
in the tree). In the example the evaluation is 
to start with row 2. The rules for following the 
evaluation strings for the row under examina- 
tion are : 

1. If this is the first time this row has been 
examined and, 

a. If the minor string pointer is not 
empty; proceed to the row named by 
it. 

b. If the minor string pointer is empty; 
evaluate this row and proceed to the 
row named by the major string 
pointer. 

2. If this is the second time this row has 
been examined, evaluate this row and 
proceed to the row named by the major 
string pointer. 

In this example the ‘(’and*)’ symbols should 
be treated as “do nothing’’ operators when their 
evaluation is called for by the rules. Ross’s 
representation is part of a more general system 
and, hence, some features of the representation 
are not pertinent to this discussion. A repre- 
sentation similar to Ross’s based on threaded 
lists is described in Ref. 9. 

Evans^ bases his intermediate form on postfix 
notation. The expression A:=B*C-f-D*(E — F) 
in postfix form is ABC*DEF— *+ :=. In this 
form both the structure and the order of sub- 
expression evaluation are implicit. The right- 
hand operand of an operator is the first com- 
plete expression to the left of the operator and 
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the left-hand operand is the second complete 
expression to the left of the operator. In the 
example the right-hand operand of the is 
'DEF — and the left hand operand is 'BC*’ 
The subexpressions are evaluated in left to 
right order. 

The transformation of a completely paren- 
thesized expression into matrix form is rela- 
tively simple. The expression is scanned from 
left to right. The following rules are applied 
to each symbol as it is encountered (a variable 
name is considered as a single symbol) : 

1. If the symbol is not a continue the 
scan. 

2. If the symbol is a O' and the expression 
is properly formed then the four symbols 
to the left of the 0 ' should be of the form 

‘sl#s2', where ‘si' and ‘s2’ stand for 
variable names or integers (row num- 
bers) and is any operator; write 
si s2' as the next row of the matrix 
and replace ‘(sl#2)’ in the expression 
by the number of the row just written in 
the matrix. Continue the scan. 

In Figure 3 the changes in the expression are 
shown on the left, the small arrow under the 
expression indicating the points, during the 
scan, at which a row is written in the matrix. 
The row actually written at that point is shown 
on the right. 

Completely parenthesized expressions have 
such an overabundance of parentheses that they 
are difficult to read; hence, languages such as 
ALGOL have precedence rules making it un- 
necessary to write completely parenthesized ex- 
pressions. The language is a precedence gram- 
mar if the precedence rules are such that given 
two adjacent operators it is unambiguous which 
is to be evaluated first. Thus if a language is 
a precedence grammar it is possible to construct 
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Figure 4. 

a table of the type shown in Figure 4. To deter- 
mine which of two adjacent operators to evalu- 
ate first, find the intersection of the row labeled 
with the left operator and the column labeled 
with the right operator. In the context of 
transforming an expression into matrix form, 
the order of evaluation of the operators is to 
be interpreted as the order of their entry into 
the matrix. A subexpression enclosed by paren- 
theses is to be completely evaluated before any 
consideration is given to the operators on either 
side of it. Applying these considerations we 
have the following rules for transforming an 
expression to •matrix form. We enclose the 
expression on the left by ‘| — ' and on the right 
by ‘ — r. Again we scan from left to right, ap- 
plying the rules to each symbol in turn : 

1. If the sym^bol is an operator, #1, and the 
left context is, 

a. ‘I — si' ; continue the scan. 

b. ‘ (si' ; continue the scan. 

c. ‘s2#2sl'; look up #2#1 in the table. 
If the table entry is, 

i. ‘right' ; continue the scan. 

ii. ‘left'; write ‘#2 s2 si' as the next 
row of the matrix, replace ‘s2#2sl' in 
the expression by the number of the 
row just written in the matrix, and 
apply rule 1 again. 

2. If the symbol is ‘)' and the left context 
is, 

a. ‘s2#sl'; write ‘# s2 si' as the next 
row of the matrix, replace ‘s2#sl' in 
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the expression by the number of the 
row just written in the matrix, and 
apply rule 2 again. 

b. ‘{s’; replace ‘(s)’ by ‘s’ and continue 
the scan. 

3. If the symbol is ‘— |’ and the left context 
is, 

a. ‘s2#sl!; write ‘# s2 si’ as the next 
row of the matrix, replace ‘s2#sl’ in 
the expression by the number of the 
row just written in the matrix, and 
apply rule 3 again. 

b. ‘I — s’; the expression has been com- 
pletely transformed into matrix form. 

This is the essence of bounded context transla- 
tion. The rules just stated show that N=3 for 
the precedence grammar of appendix A. That 
is, in deciding what action to take next, at most 
only the three symbols immediately to the left 
of the symbol currently under scan need be 
examined regardless of the length of the ex- 
pression being translated. 

Before examining some ways that bounded 
context analysis has actually been implemented 
let us restate, precisely, the above rules in the 
form of a flow chart (Figure 5). In the flow 
charts of this paper the true exit of a decision 
box is marked t and the false exit is marked f. 
We consider the expression being analyzed as 
a string of symbols, S (indexed by k), bounded 
on the left by ‘| — ’ and on the right by ‘ — |’. 
We will distinguish three classes of symbols: 

1. I, the class of identifiers: any variable 
name. 

2. R, the class of matrix row numbers : any 
integer. 

3. d, the class of operator symbols: 6 = 

{+. — , *, /,:=,(,), I— — |}. For fu- 
ture use we distinguish the subclass of 
arithmetic operators, e° = — , *, /}. 

Symbols from the input string are transferred 
to an auxiliary list, L (indexed by j). This will 
avoid the problem of gaps when replacements 
are made in the input string. M is the matrix 
(with i the row index) and T(x, y) is a func- 
tion whose arguments are a pair of operators 
and whose value is the label found at the inter- 
section of the x-row and the y-column in Figure 


6. The label ERR has been filled in for all illegal 
operator pairs. If one of these occurs then the 
expression is incorrectly formed. The questions 
of the detection of incorrectly formed expres- 
sions and what action to take when errors are 
detected is very important in actual translator 
construction. These questions will not, how- 
ever, be discussed in this paper. We will assume 
here that all expressions are properly formed. 
When the function T(x, y) takes on the value 
END then the matrix form of the expression 
is complete. 



Figure 6. 
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It is obvious from the table in Figure 6 that 
most operator pairs cause one of two actions 
to take place: 1) the operator currently under 
scan is put on the list, or 2) rows are written 
in the matrix until an operator pair is found 
which calls for some other action. The only 
exceptions are the removal of parentheses and 
termination of the transformation. If a preced- 
ence function is defined for each operator then 
a table of all operator pairs is not necessary. 
A precedence function, P(x), is defined such 
that, given a pair of operators #1#2, if 
P(#1)#^#P(#2) then the operator #1 is 
evaluated before the operator #2. A precedence 
function for the operators used in the previous 
examples is defined in Figure 7. The algorithm 
for transforming an expression into matrix 
form that is used in GAT^ and MAD®''^ uses 
a precedence function. The flow chart in 
Figure 8 gives an algorithm for generating the 
matrix form of the expression. This algorithm 
differs from the GAT-MAD algorithm only in 
the direction of scanning the expression. No- 
tice that, assuming all expressions are properly 
formed and since P(‘G) < P(‘)') and PG | — ') 
< PG — I’), when the test P(Lj — 1)) ^ 
P(S(k))fails then S(k)=‘)’ implies L(j — 1) = 

and S(k) = ‘ — |’ implies L(j — 1) = ‘| — 

Bauer and Samelson^o use two lists, one, 
L, in the translator and one, N, in the object 
program. L, called the “symbols cellar,’’ is used 
for storing operators which can not be evalu- 
ated yet, just as in the previous algorithms. N, 
called the “numbers cellar,” is used by the 
object program during execution for the tem- 
porary storage of partial results and values of 
identifiers. Their algorithm does not generate 
an intermediate form, but immediately gen- 
erates machine code* In the examples of this 
paper, the machine language used will be that 
described in Appendix B. This language is very 
similar to the FAP symbolic machine language 
used on the IBM 709-90-94 computers. C (in- 
dexed by m) will be a matrix where generated 
machine instructions are stored. The first col- 
umn will contain the operation code and the 
second column the address. Figure 9 is the 
table, for Bauer and Samelson’s algorithm, cor- 
responding to the table in Figure 6, and Figure 
10 is their algorithm. Whenever an identifier 


is encountered in the scan, instructions are 
generated to move its value onto the N list. 
Bauer and Samelson give, in the same paper, 
modifications to this algorithm which will gen- 
erate more efficient machine code. 
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Figure 10. 

Ross's algorithm, as given in Ref. 12, trans- 
forms more than just expressions into tree 
form. In the version of his algorithm given in 
Figures 11 and 12 the machinery not needed to 
transform expressions has been omitted. A 
minor evaluation string pointer is set whenever 
the right operand of an operator is set and 
both operands are variables, or whenever a left 
operand is set and the operator is a modifier. 
The only modifier in an expression is the '('. 
The minor evaluation string is forced to pass 
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Figure 12. 

through modifiers since a modifier may change 
the interpretation of the right operand. For 
example, the right operand of may be either 
a normal subexpression or the argument of a 
function depending upon whether the left argu- 
ment of the '('is empty or is an identifier. A 
major evaulation string pointer is set whenever 
a right or left operand is set and the operand 
is a row number. 
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Evans^ uses an algorithm based on Floyd's 
productions.!^ Instead of generating machine 
code directly as Floyd did in his paper, Evans 
transforms the expression into postfix form. 
This algorithm is probably the most versatile 
of the lagorithms which we have described here. 
The central idea here is a set of productions 
which determine the course of action at each 
step in the translation process. A production 
consists of five parts ; 

1. A top of stack configuration. 

2. A replacement for the configuration of 
part 1. 

3. The name of an action routine. 

4. Scan, no scan flag. 

5. Which production to consider next. 

Figure 14 gives the productions for transform- 
ing an expression into postfix form. The ex- 
pression is scanned from left to right. As each 
new character is scanned it is put on the top 
of the pushdown stack and the productions are 
then searched for the first one whose part 1 
matches the present top of the stack (when a 
class symbol such as 0 ^ appears, any member 
of that class will give a match) . When a match 
is found that portion ofi the top of the stack 
which matched part 1 of the production is re- 
placed by part 2 of the production. If part 2 
is empty, the replacement degenerates to the 
deletion of the symbols, which matched part 1, 
from the top of the stack. The action routine 
named in part 3 is then executed. After the 
action routine has been executed the produc- 
tions are again searched for a match with the 
top of the stack ; however, the search is started 
with the production whose line number is given 
in part 5 of the last interpreted production. If 
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a appears in part 4, a new symbol is scanned 
from the input string before the search con- 
tinues. The productions given in Figure 13 are 
sufficient only to transform the simple expres- 
sions of this paper. The productions that Evans 
gives will transform all the statements of 
ALGOL into postfix form. Figure 15 gives an 
algorithm for transforming an expression into 
postfix form using the productions of Figure 
13. The action routine OUT puts the last iden- 
tifier scanned into the output string, M. The 
action routine OUTP(x) puts its argument, x, 
into the output string. The productions cause 
the unary operator ‘loc’ to be introduced into 
the output string following a variable which is 
on the left .side of ‘ .which indicates that 
a location (where a value is to be stored) is 
involved rather than a value. The action rou- 
tine COMP uses the precedence function, de- 
fined in Figure 14, to determine when operators 
are placed in the output string (i.e., to deter- 
mine the sequence of evaluation of the oper- 
ators) . 

Once the matrix form of the expression has 
been generated, the final translation to sym- 
bolic machine code is relatively simple. Corre- 
sponding to each operator is a set of machine 
instructions, a pattern. Figure 16 gives the 
patterns for the operators of our language ; the 
first column is the operation code and the sec- 
ond column is the address. The matrix is trans- 
lated one row at a time, in sequence. Row i of 
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Pattern Table 


the matrix, M, is translated by making a copy, 
in the code output matrix, of the pattern corre- 
sponding to the operator M(i, 1), replacing all 
occurrences of *V by the left operand, M(i, 2), 
all occurrences of V’ by the right operand, 
M(i, 3), and all occurrences of ‘P by the row 
number, i. The row numbers (integers) which 
appear in the code are to be interpreted as the 
names of temporary storage locations. Figure 
17 is an algorithm for performing this trans- 
lation. N is the number of rows in the matrix, 
M, C is the code output matrix, PATl(x) a 
function whose value is the index of the first 
line of the pattern for the operator x, and 
PAT2(x) a function whose value is the index 
of the last line of the pattern for x (both these 
functions are defined in Figure 18). The trans- 
lation of the matrix in Figure 1 is shown in 
Figure 19. 

It is immediately obvious that very inefficient 
machine code is produced by this algorithm. 



Figure 16. 


Figure 17. 
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Matrix Machine code 


Figure 19. 

Once we begin to consider the production of 
efficient machine code, the algorithms rapidly 
get very complicated. We can make some im- 
provement, however, without a great deal of 
complication. In the example are several redun- 
dant store and fetch instructions. These can 
be eliminated if we keep track of the contents 
of the special machine registers. We then insert 
store instructions only when the current con- 
tents of a special register is not one of the 
operands of the next operator and fetch in- 
structions only when the operand is not already 
in a special register. To implement this we 
generalize the idea of patterns. Corresponding 
to each operator is a variable pattern, that is, 
the instructions which are actually copied into 


the code output matrix depend upon the con- 
tents of the special registers. 

The method used in the MAD translator is 
general enough for the machine structure as- 
sumed in this paper. The problem of gener- 
ating efficient machine code is a very difficult 
one and is yet unsolved. There are methods, 
undocumented, other than the one to be de- 
scribed but none which can claim to produce 
highly efficient code in all circumstances. The 
patterns will be arranged so that the result of 
a row is, in general, not stored, i.e., it will be 
left in one of the registers AC or MQ. The 
machine code produced when a row of the 
matrix is translated will depend on the values 
of four Boolean variables. These variables are 
named AC, MQ, LO, and RO. Suppose we are 
ready to translate the ith row of the matrix, 
then these variables have the following mean- 
ings: 

1. If AC is true, the result produced by row 
i-1 is in the AC. 

2. If MQ is true, the result produced by row 
i-1 is in the MQ. 

3. If LO is true, the left operand in row i is 
i-1 (i.e., the result of row i-1) . 



Figure 20. 
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4. If RO is true, the right operand in row i 
is i-1. 

Instead of regarding the patterns as a fixed set 
of machine code to be produced in translating 


a row of the matrix, we now take the view 
that the pattern is really a small program 
which, when executed, produces machine code. 
Viewing it in this light, we need a language in 
which to write the program. 
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In our language we will need the following 
types of instructions: produce a machine in- 
struction, branch on the truth or falsity of one 
of the Boolean variables, absolute transfer, set 
the value of one of the Boolean variables, and 
halt. Figure 20 is a flow chart for a program 
to produce the code for where *r and *r' 
have the same meanings as in Figure 16, but 
'V now refers to the temporary used to store 
the result of the previous row. Notice that if 
there is a result in the AC or MQ and it is not 
either of the operands then instructions to 
store it and fetch one of the operands are gen- 
erated. If one of the operands is in the wrong 
special register an exchange instruction is gen- 
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erated. The word COMPILE means, here, 
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A command in our pattern programming lan- 
guage will have a line number, an operation 
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Figure 23. 
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Figure 25. 

code, and two operand fields. The commands 
are: 

The program for translating the matrix into 
machine code now becomes an interpreter 
which executes (interpretively), for each row 
of the matrix the appropriate pattern program. 
The pattern programs will be stored in a 
matrix, P (indexed by k), the first column" is 
the operation code, the second column is the 
first operand, and the third column is the sec- 
ond operand. As before, we have a function 
PAT(x) whose value is the index of the first 


line of the pattern program for the operator x. 
The values of LO and RO are set by examining 
the operands in a row before executing the pat- 
tern program for that row. The function 
PAT(x) is defined by the table of Figure 21, 
Figure 22 gives the pattern programs (the P 
matrix), and Figure 23 is the interpreter for 
the pattern programs, i.e., the algorithm for 
translating the matrix into machine code. 
OP(y) is a function, defined in Figure 24, 
whose argument, y, is one of the operation 
codes of the pattern programming language and 
whose value is a label, the label of that portion 
of the algorithm which interprets the operation 
code y. Figure 25 shows the translation of the 
matrix in Figure 1 into machine code using the 
algorithm of Figure 23. For each row of the 
matrix is shown the machine code produced for 
that row and the status of the four Boolean 
variables after translating that row and just 
before considering the next irow, that is, at 
point 3 in the flow chart of Figure 23. Notice 
that just this simple consideration of the con- 
tents of the special registers gives us a saving 
of five instructions when compared to the in- 
structions produced by the algorithm of Figure 
17. 

It is obvious that only trivial modifications 
are necessary to be able to use the pattern pro- 
gram interpreter with Ross’s intermediate 
form. Instead of considering the rows of the 
matrix in sequence, the minor and major evalua- 
tion strings are followed. When the rules for 
following the evaluation strings call for evalua- 
tion of a row, the appropriate pattern program 
is interpreted. Evans uses an algorithm very 
similar to the pattern program one which we 
have described. 

No consideration has been given to the type 
(real, integer, etc.) of the identifiers involved. 
In many computers there are different instruc- 
tions for floating decimal (real) and integer 
arithmetic. This is easily taken care of by 
having, for example, two pattern programs for 
‘+’, one to be interpreted when the operands 
are real, and the other to be interpreted when 
the operands are integer. Finally, it is clear 
that the interpreter instead of generating ma- 
chine instructions could actually execute them, 
thus turning the entire translator itself into 
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an interpreter which could execute (interpre- 

tively) programs written in the source lan- 
guage. 
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APPENDIX A 

Definition of the Language Used in the 
Examples 

<identifier> : :=A |B|C|D!EiF|G|H 
<primary> : :=<identifier> | 

( <expression> ) 

<mop> ::z=* | / 

<aop>::=-|- | — 

<term> : :=<primary> | <term><mop> 

< primary > 

<expression> : :=<term> | <expression> 
<aop> <term> 

< assignment statement > : :=<identifier> 
<expression> 

APPENDIX B 

The Machine Language Used in the Examples 

There are two special registers, the AC and 
the MQ. Instructions are single address. The 
meaning of the instructions is expressed as a 
short ALGOL program. The ‘?’ in the context 
‘MQ := ?’ means that the contents of the MQ is 
indeterminate. When the name of a register 
does not appear to the left of an ‘:=’ in the de- 
scription of an instruction, then the value of 
that register is unchanged. 

Instruction Meaning 
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SYNTAX-DIRECTED COMPILING 

T. E. Cheatham, Jr., and Kirk Sattley 
Computer Associates, Inc. 

Lakeside Office Park 
Wakefield, Massachusetts 


INTRODUCTION 

This paper is primarily concerned with the 
analysis of source statements in a programming 
language, although some of the ideas and tech- 
niques may be applicable to the analysis of 
source statements in a natural language. We 
are particularly concerned with those tech- 
niques which might be classed as predictive; 
the companion paper by Graham' is concerned 
with other (“nonpredictive”) techniques of 
analysis. Very broadly the techniques we will 
discuss operate as follows : Given a set of rules 
(Syntax Specification) for forming allowable 
constructs, eventually resulting in a statement 
(or sentence, word, program, etc.) of a lan- 
guage, we analyze a source statement in that 
language by guessing, or predicting, how the 
statement is constructed and either verifying 
that this is the case or backing up to try again, 
assuming some other method of construction. 
We keep a “history” of our attempts and when 
we have determined the exact way in which the 
statement is constructed we can use this “his- 
tory” of its construction for further processing 
of the components of the statement. 

We will be concerned, secondarily, with the 
synthesis of machine coding, given an analysis 
of a source statement. We do not, however, dis- 
cuss in any detail the difficult (and, at this 
iwint, not terribly well understood) problems 
of s 3 Tithesizing highly efficient coding. Refer- 
ence £1] contains a brief discussion of this 
problem. 


We are concerned hardly at all with the ex- 
tremely important and often neglected problems 
of the^environment in which a compiler or code 
resulting. from a compiler is to operate. Refer- 
ence [S] sketches our position in this matter. 

The phrase “syntax-directed” in the title 
refers to the method by which the compiler is 
given the syntactic specification of the language 
it is to compile. That is, rather than having the 
syntactic structure of the language reflected in 
the actual encoding of the compiler algorithm, 
a “syntax-directed” compiler contains (or uses, 
as parametric data) a relatively simple and 
direct encoding of the syntactic structure of 
the language, for example, as it might be ex- 
pressed in Backus Normal Form. By “simple 
and direct encoding,” we mean, for instance, 
numerical codes for the distinct syntactic types 
of the language, and direct pointers represent- 
ing the relations of concatenation and alterna- 
tive choice, plus perhaps some sorting. 

This paper is not intended as a review or 
critique of syntax-directed compilers or com- 
piler techniques nor have we presented a com- 
prehensive bibliography on the subject. Bather, 
our purpose is tutorial — ^to present in as 
straightforward a manner as possible the essen- 
tial ideas of syntax-directed compilers. Un- 
fortunately, there is, at the present time, no 
completely adequate review paper on the sub- 
ject; Floyd^® does include a rather complete 
bibliography. 
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Our presentation commences with a discus- 
sion of syntax and the syntactic specifications 
of languages — programming languages in par- 
ticular. We then discuss techniques for en- 
coding the syntax into tables and develop a 
simple algorithm, the ANALYZER, which can 
perform a syntactic analysis of source material, 
using this tabled syntax specification as data. 
From this we proceed to a discussion of the 
generation or synthesis of code from the results 
of the analysis. Finally, we discuss several 
problems and limitations. Certain problems of 
syntactic specification and some modifica- 
tions of the schemes we describe in the body of 
the paper have been discussed in an appendix. 

SPECIFICATION OF SYNTAX 

Several essentially equivalent formalisms for 
the representation of syntax have been devel- 
oped. These include such things as 

Post Production Systems, developed by the 
logician Emil Post during the 1940's as a tool 
in the study of Symbolic Logic ; 

Phrase Structure Grammars, developed by 
the linguist Noam Chomsky during the 1950’s 
as a tool in the study of natural languages ; and 

Backus Normal Form, developed by the pro- 
grammer John Backus during the late 1950’s 
as a tool in the description of programming 
languages. 

We shall use here a formalism most similar 
to.Backus’s. 

A syntactic specification of a language is a 
concise and compact representation of the struc- 
ture of that language, but it is merely that — 
a description of structure — and does not by 
itself constitute a set of rules either for produc- 
ing allowable strings in the language, or for 
recognizing whether or not a proffered string 
is, in fact, an allowable string. 

However, rules can be formulated to produce, 
or recognize, strings according to the specifica- 
tion. In a ‘‘syntax-directed’’ compiler it is an 
algorithm which performs the recognition of 
allowable input strings, and it does this by 
using (an encodement of) the Syntax Specifica- 
tion as data. In this paper, we shall call such 
an algorithm an (or the) “Analyzer.” 


In order to discuss the structure of the lan- 
guage,., we give names to classes of strings in 
the language — we call these names (or the 
classes they denote) “Syntactic Types.” Some 
of the classes of interest consist of single char- 
acters of the source alphabet: these we call 
“Terminal Types,” and specifically “Terminal 
Characters” ; to talk about any particular one, 
we will merely display the character. Most of 
the classes, though, are more complicated in 
structure and are defined in terms of other 
classes; these we call “Defined Types,” and to 
designate one, we choose a mnemonic name for 
the class and enclose it in the signs ‘<’ and ‘>’. 

Basic Syntax Specification 

Rather than proceed immediately to a dis- 
cussion of Backus Normal Form, we shall first 
define a simple form of Syntax Specification — 
the Basic Specification. This consists of a set 
of “Simple Type Definitions” (meaning, not 
that the Types are simple, but the Definitions 
are). A Simple Type Definition consists of the 
name of a Defined Type, followed by the curious 
sign ‘::=’ followed by a sequence of Syntactic 
Types, Defined or Terminal. An example, taken 
from the Syntax I — soon to be discussed — 
would be : 

<assignment> : :=<variable>==<arithexpr> 

The Defined Type on the left of the ‘::==’ is 
called The Defined Type of the Definition ; and 
the Definition is said to be a Definition of its 
defined type. In general — even for the more 
complicated forms of Type Definitions yet to 
come — we shall call the right-hand side of the 
Definition the “Definiens.” Any sequence of 
type designators appearing in a Definiens is 
called a “Construction,” and each type designa- 
tor within the Construction is a “Component” 
of the Construction. So, the above example is a 
Definition of the Defined Type < assignment > ; 
its Definiens is a Construction with three com- 
ponents, which are, in the order of their ap- 
pearance, the Defined Type <variable>, the 
Terminal Charcater ‘m’ and the Defined Type 
<arithexpr>. 

A Simple Type Definition of this sort states 
that, among the strings of the source language 
belonging to the Defined Type, are those which 
are concatenations of substrings — as many 
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substrings as there are components of its (Sim- 
ple) Definiens — such that each substring (in 
order of concatenation) belongs to the Syntac- 
tic Type named by the corresponding Compo- 
nent (in order of appearance in the Definiens). 
Applied to our example : A source string belongs 
to the class <assignment> (or, for short, “is 
an <assignment>'') if it can be broken into 
three consecutive substrings, the first of which 
is a <variable>, the second of which is the 
single character and the third of which is 
an <arith expr>. 

If we were interested in using Syntax Specifi- 
cations as ‘“generative grammars'’ — ^that is, if 
we were writing an algorithm to use a Syntax 
Specification to produce samples of strings of 
the language, we would write something which, 
applied to our example, would have the effect 
of: “if you wish to produce an <assignment>, 
then: first choose any definition of <variable> 
and produce a string according to that defini- 
tion, then, second write down the character ‘=', 
then third produce an <arith expr> according 
to any definition of that type; then you have 
produced an <assignment>" 

Thus, the use of a (Basic) Syntax Specifica- 
tion as a generative grammar is quite straight- 
forward. The inverse problem — using the Syn- 
tax Specification as a “recognition grammar" — 
is, like many inverse problems, rather more 
involved. In our opinion, the fundamental idea 
— ^perhaps “germinal" would be a better word 
— ^which makes syntax-directed analysis by com- 
puter possible is that of goals : a Syntactic Type 
is construed as a goal for the Analyzer to 
achieve, and the Definiens of a Defined Type 
is construed as a recipe for achieving the goal 
of the Type it defines.* It is this use of goals 
which leads to another description of analysis 
techniques of this kind — “predictive analysis" : 
setting up the recognition of a particular Syn- 
tactic Type as a goal amounts to predicting that 
an instance of that type will be found. Need- 
less to say, this use of the term “goal" is not 
to be confused with the “goal-seeking behavior" 
of “artificial intelligence" programs or “self- 

* To our knowledge, the first person to formulate and 
implement this conception was E. T. Irons, in his initial 
design of the PSYCO compiler; his paper [4] describes 
the essentials of his compiling technique. 


organizing systems.” However, when we come 
to specifying the algorithm for selecting par- 
ticular goals in a particular order, we reach 
the point at which the several existing syntax- 
directed techniques diverge. Our purpose in 
this section on “Basic Syntax Specification” is 
to lay a foundation common to the principal 
different applications of the technique; hence, 
if we try to “picture” the use of a Syntax 
Specification as a recognition grammar, as we 
pictured its use as a generation grammar in 
the preceding paragraph, the most generally 
valid statement we can make is : 

We can say that we have recognized an oc- 
currence of a given Syntactic Type (at a given 
position in the source string) if one of the two 
following conditions obtains : 

1. The Syntactic Type is a Terminal Char- 
acter, and the character at the given posi- 
tion in the source string is exactly that 
Terminal Character ; 

2. The Syntactic Type is a Defined Type, and 
for some one of its (Simple) Definitions, 
we have already recognized concatenated 
occurrences of the Components of that 
Definiens, in the stated order, the first 
of which occurs at the given position. 

In 6rder for the set of Simple Type Defini- 
tions to constitute a useful Syntax Specifica- 
tion, it should satisfy some conditions. 

(Cl) Any Defined Ts^pe which occurs as a 
Component in any Definiens must also occur as 
the Defined Type of some definition. 

The desirability of this “completeness condi- 
tion” is fairly obvious — it will be very difficult 
to recognize a Defined Type if the Analyzer has 
no Definition of that Type. Of course, it is pos- 
sible that the Analyzer may never be asked to 
find an instance of this Type, but then all the 
Definitions which included it as a Component 
would also be superfluous. 

(C2) Every Defined Type must ultimately be 
constructible entirely out of Terminal Charac- 
ters. 

This “connectedness condition” is designed to 
prevent a cycle of definitions which it is im- 
possible to break out of — ^that is, if a Defined 
Type is defined only in terms of Defined Types, 
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each of which in turn is defined only in terms 
of Defined Types, etc. Of course, it will be true 
that there will be cycles within the act of defini- 
tions, and these cycles may be traversed arbi- 
trarily many times ; but there must exist some 
point in each cycle where an alternative defini- 
tion of one of the types exists. It is probably 
sufficient to restate condition (C2) in the fol- 
lowing fashion : 

A Terminal Type will be said to be “grounded.” 
A Defined Type is grounded if it has at least 
one Definition, all of whose Components are 
grounded ; then 

(C2') Every Defined Type must be grounded. 

(C3) There must exist exactly one Defined 
Type which does not appear as a Component in 
any Definiens (except possibly its own). This 
Type is called the “Starting Type” of the Syn- 
tax Specification. 

The Starting Type represents the “largest” 
construction allowable under the Specification 
— e.g., “sentence,” or perhaps “paragraph,” in 
natural language applications, or usually “pro- 
gram” in compiler applications. If there is no 
Starting Type, the Analyzer, quite literally, will 
not known where to begin. 

Let us note here in passing that there is a 
property of Syntax Specifications which is of 
great importance to theoreticians in this field, 
and to people who are designing new languages 
or trying to construct Specifications for exist- 
ing complex languages, but which is irrelevant 
to the problem of programming a syntax-di- 
rected Analyzer. This is the question of “struc- 
tural ambiguity” — does the Syntax Specifica- 
tion permit a particular source-language string 
to be correctly analyzed in two different fash- 
ions? A simple example, taken from natural 
language (with apologies to Oettinger) is: 
“Time flies incessantly.” This is certainly an 
English sentence — ^but is it a metaphorical dec- 
larative sentence, or a terse imperative? In 
the case of an Analyzer algorithm on a com- 
puter, only one thing is done at a time — if the 
Analyzer is asked to find an instance of an 
Ambiguous Sjmtactic Type, it must try one of 
the possible definitions first; if that definition 
succeeds, the Analyzer is satisfied, and the other 
definitions are not considered. This is not to 


say that an Analyzer, one of whose functions 
is to find all possible analyses, cannot be built ; 
this has been done by Oettinger^^ for natural 
language, and by Irons®, for use in compiling. 

Some Transformations of the Basic 
Specification 

We shall now proceed to build up to the de- 
scription of a particular simple Analyzer algo- 
rithm, and at this point, we must choose one 
among several different techniques. The differ- 
ences between the various techniques stem from 
the following considerations : 

— Given a Syntax Specification, there are 
different ways of using it to determine the next 
goal which the analyzer is to pursue. The two 
major approaches are called the “top-down” 
and the “bottom-up” techniques. 

— There are different ways to use the output 
of the Analyzer, e.g., interpretation, immediate 
generation of output code, recording of the 
analyzed structure for later generation, etc. 

The particular type of Analyzer we have 
chosen to describe here is, we believe, the easiest 
to explain, and is suitable for any of the three 
output-treatments mentioned above. It does not 
correspond, so far as we know, to any actually 
existing compiler system, although it bears a 
surprisingly strong resemblance to the algo- 
rithm used in some of the compilers that Com- 
puter Associates, Inc., has recently produced. 
(See Shapiro and Warshall)^ 

The first step is to transform our Basic Syn- 
tax Specification into a Backus Normal Form. 
The description of a Syntactic Type Definition 
is now expanded so that the Definiens, instead 
of simply a Construction (which, remember, 
was a sequence of Components, which, in turn 
were Syntactic Types) can now be a sequence 
of Constructions, separated by the special sign 
‘I'. Any such sequence of Constructions, sep- 
arated by *1' and appearing in a Definiens is 
called an “Alternation,” and the individual Con- 
structions in the sequence are called “Alterna- 
tives” of the Alternation. To transform a Basic 
Syntax Specification into Backus Normal Form, 
we must repeatedly apply the following trans- 
formation rule to the set of Definitions, until 
it can no longer be applied : 
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(Tl) If any Defined Type has more than one 
Definition in the set, delete all such Definitions, 
and add to the set a new Definition whose left- 
hand side is the Defined Type in question, and 
whose Definiens is an Alternation of the origi- 
nal (Basic) Definientes. 

As an example, the Basic Syntax Specifica- 
tion for the simple language we are using for 
illustration in this paper would have contained 
three definitions for < factor > : 

<factor> <variable> 

<factor> <integer> 

<factor> (<arith expr>) 

After applying (Tl) to the Basic Specification, 
these three Definitions would be replaced by the 
single Definition 

<factor> :::= <variable> | <integer> | 
(<arith expr>) 

This Definition, of course, should be read ‘"a 
source string is a <factor> if it is either sl 
<variable> or an <integer> or an <arith 
expr> enclosed in parentheses.’’ This Backus 
Normal Form is exactly the form of Syntax 
Specification used in the defining documents 
for ALGOL 60 [8], and Table 1 presents a 
complete syntax for a simple arithmetic pro- 
gramming language in this form, which we 
shall refer to as ‘‘Syntax I.” 

The Action of the Analyzer 

We can now sketch out the action of the 
Analyzer: At the beginning of the process, it 
takes the Starting Type of the Specification as 
its first goal. Then at any point in the process 
it follows these steps when it has a Defined 
Type as its current goal : 

The Analyzer consults the Definition of the 
Defined Type (in Backus Normal Form, of 
course, each Defined Type has a unique Defini- 
tion), and specifically, it considers the first 
Alternative in that Definition. It then succes- 
sively takes each Component of that Alternative 
as a sub-goal. (Of course, it must re-enter 
itself for each of these goals, and it must keep 
track of where it was at each level of re-entry.) 
If at any point it fails to find one of these sub- 
goals, it abandons that Alternative, and con- 
siders the next Alternative in that Definition, 


if there is one, and steps through the Compo- 
nents of that Alternative. If there is no next 
Alternative, it has failed to realize its current 
goal, and reports this fact “upstairs.” If it 
succeeds in finding the sub-goals corresponding 
to each of the Components of any Alternative 
in the Definition of its current goal, it has 
found its goal, and reports that fact. 

This rough sketch conveniently ignores a 
number of sticky points which we now have to 
consider. The first of these points is that we 
discussed the action of the Analyzer only when 
its current goal was a Defined Type. What if 
the goal is a Terminal Character? 

When it comes to writing a compiler in prac- 
tice, the question of recognizing Terminal Char- 
acters brings us face to face with the lovely 
problems of restricted character sets, input- 
output idiosjmcracies of the particular com- 
puter, etc. Both in practice and in the remain- 
der of this paper, we assume the presence of 
another routine, called the “Recognizer,” which 
the Analyzer can call upon to deal with these 
problems. So far, we have also glossed over the 
problem of keeping track of where in the Input 
String the Analyzer is looking. Obviously, when 
the first Component of some Construction has 
been recognized, starting at a certain point in 
the Input String, then, when the Analyzer pro- 
ceeds to look for the next Component, it must 
move its Input-String pointer past the substring 
which satisfied the first Component. Now, since 
a Type which has been successfully recognized 
consists, ultimately, of a sequence of Terminal 
Characters, and the recognition of Terminal 
Characters is the job of the Recognizer, we 
shall also leave the moving of the Input-String 
pointer to the Recognizer. The fundamental 
action of the Recognizer is then as follows : 

The Recognizer is called by the Analyzer, 
and asked if a specified Terminal Character 
occurs at a stated character position in the 
Input String. The Recognizer then negoti- 
ates with the I/O system of the computer 
(if necessary) and examines the character- 
position in the input string. If the input 
character at that position is not the Terminal 
Character the Analyzer asked for, the Recog- 
nizer reports failure. However, if the input 
character is the desired Terminal Character, 
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the Recognizer reports success to the Analy- 
zer, and advances the Input-string pointer by 
one character position. 

Having the Recognizer at hand, it turns out 
to be convenient in practice to give it some fur- 
ther responsibilities. Consider the definitions 
of <variable> and <integeT> in Syntax I. 
These amount to saying that an < integer > is 
any sequence of digits, and a < variable > is 
any sequence of letters or digits as long as the 
first one is a letter. If we relegate the recogni- 
tion of these fundamental types to the Recog- 
nizer, rather than the Analyzer, we obtain 
several advantages. 

— The Recognizer can be hand-tailored to per- 
form these particular recognitions very effi- 
ciently on the particular machine, and this 
speeds up the analysis considerably. 

— As far as the Analyzer is concerned, if the 
Syntax Specification calls for an <integer>, 
for instance, any old integer will do. But when 
we come to generating output code, well need 
to know the particular integer which occurred 
at that point. The Recognizer can perform the 
conversion from external number representa- 
tion to machine representation, and either re- 
turn the numerical value, or enter the number 
in a “Literal Table'' and return its index value. 
Similarly, when it recognizes <variable>, it 
can look in a “Symbol Table" for previous oc- 
currences of that particular variable, add it to 
the table if necessary, and return a line num- 
ber. 

— In practical applications the question of 
what constitutes a “blank" is often an involved 
one. In some languages, a long comment may 
function syntactically as a blank. When a com- 
piler runs under the control of some operating 
systems, certain segments of the Input string 
(e.g., identification fields in cards) must be 
treated as blanks, or ignored entirely. Since 
the Recognizer constitutes the interface be- 
tween the Analyzer and the outside world, it 
can take care of these matters. 

To allow for this extended Recognizer in our 
Syntax Specification, we allow another sort of 
Terminal Type (up to now, we recall, the only 
Terminal Types have been Terminal Charac- 
ters). We designate these new Terminal Types 


with script capital letters, and call them “Ter- 
minal Classes." Thus, in Syntax I, we can de- 
lete the definitions of <variable>, <integer>, 
<letter>, and <digit>, and replace the De- 
finientes of <variable> and <integer> by the 
Terminal Classes Q/ and (jy respectively. This 
produces Syntax II, Table 1, which is the one 
we shall refer to throughout the rest of this 
paper. 

But this technique could be carried further. 
A compiler-builder might decide that he prefers 
operator-precedence techniques for the analysis 
of arithmetic expressions, while keeping the 
flexibility of syntax-direction for analysis of the 
larger constructions. His arithmetic-expression 
scanner would then function as a Recognizer 
for the previous Defined Type ‘<arith expr>,' 
and, for this simple language, the Syntactic 
Specification would take the form of Syntax III, 
Table 1. 

To summarize: When the current goal is a 
Defined Type, the Analyzer calls upon itself 
to find it, but when the goal is a Terminal Type, 
it calls upon the Recognizer. When the Recog- 
nizer is called, it determines according to its 
own internal rules, whether the desired Termi- 
nal Type occurs in the Input string at the cur- 
rent pointer-position ; if not, it reports failure ; 
if so, it advances the pointer past the substring 
which constitutes the Terminal Type (single 
character, or member of a Terminal Class), and 
reports success. 

Encoding the Syntax Specification 

We are now almost ready to proceed to the 
encoding of the Syntax Specification for the use 
of the Analyzer, except for one embarrassing 
question : 

Consider, as an example, the definition of 
<term> in Syntax II: 

<term> : : = <factor> | <term> * <factor> 

What if the Analyzer should find itself con- 
sidering the second Alternative in this Defini- 
tion ? This would amount to the Analyzer 
saying to itself “in order to find my current 
goal, which is <term>, I must set out to find 
the first Component of this Alternative, which 
is <term>." In order to find a term it must 
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TABLE 1 

Alternative Syntax Specifications 

Syntax 1: 

<program> <assigninent> [ <assignnient> ; <program> 

<assignment> : := <variable> =: <arith expr> 

<arith expr> : <term> | <arith expr> + <tenn> 

<tenn> <factor> | <tenn> * <factor> 

<factor> <variable> | <integer> | (<arith expr>) 

<variable> <letter> | <variable> <letter> | <variable> <digit> 

<integer> : <digit> | <integer> <digit> 

<Ietter> A | B | C | D | E | F i G [ H | 1 1 J | K | L | M | N | 0 | P [ 

Q|R|S|T|U|V|WtX|Y|Z 
<digit> ::=:0|1|2|3|4|5|6|7|8|9 

Syntax II: 

<prograni> <assignnient> | < assignment > ; <program> 

<assignment> : := <variable> =: <arith expr> 

<arith expr> : := <term> | <arith expr> + <term> 

<term> <factor> | <term> * <factor> 

<factor> <variable> | <integer> | (<arith expr>) 

< variable > Q/ 

<integer> : := (} 

Syntax III: 

<program> <assignment> | <assignment> ; <program> 

<assignment> <variable> — <arith expr> 

<aritb expr> £ 

<variable> : := (j 


be able to find a term first. This is called the 
''left recursion problem/’ and it has led some 
language designers to disallow Type Definitions 
which include Alternatives which mention the 
Defined Type of the Definition as their first 
Component. To a human analyst, of course, the 
intent of the Definition is plain ; he should first 
look for a < factor > ; if he finds one, he has 
indeed found a <term>, but he should con- 
tinue looking to see if he can find a followed 
by another <factor> ; if he can, he has found 
a "longer” <term>, and should continue look- 
ing for a still longer one; as soon as he fails 
to find a following his latest <term>, he 
can stop looking, confident that he has found 
the longest <term> at that point in the string. 
This recognition process can be embodied in the 
encoding of the Syntax Specification, but it 
does require detecting the presence of these 


left-recursive alternatives, and giving them 
some special treatment. Keeping this in mind, 
we shall proceed to encode the Syntax Specifica- 
tion. 

The encoding consists of two tables, the Syn- 
tax Type Table and the Syntax Structure Table. 
The Syntax Type Table will contain an entry 
for each Syntactic Type which occurs anywhere 
in the Syntax Specification, whether it be a De- 
fined Type or a Terminal Type. Each entry i 
the Type Table consists of two items : a yes-no 
item TERMINAL, and an integer item LOOK- 
FOR. When line t in the Type Table corre- 
sponds to a Terminal Type, TERMINAL [t] 
will be set to "yes,” and LOOKFOR [t] will con- 
tain an arbitrary code number which the Recog- 
nizer will interpret as denoting the particular 
Terminal Character or Terminal Class it should 
try to recognize. When line t in the Type Table 
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TABLE 2 
The Syntax Tables 


(GOAL) 

(Type) (Index) 

— 

TERMINAL 

LOOK- 

FOR 

< program > 

i 

No 

1 

< assignment > 

ii 

No 

4 

<arith expr> 

iii 

No 

7 

<term> 

iv 

No 

10 

< factor > 

V 

No 

13 

< variable > 

vi 

No 

18 

< integer > 

vii 

No 

19 

0 / 

viii 

Yes 

101 

0 

ix 

Yes 

102 

9 

X 

Yes 

1 

= 

xi 

Yes 

2 

+ 

xii 

Yes 

3 

* 

xiii 

Yes 

4 

( 

xiv 

Yes 

5 

) 

XV 

Yes 

6 


2.1 

Syntax Type Table 


SOURCE 

(Index) 

TYPE- 

CODE 

STRUCT 

SUC- 

CESSOR 

ALTER- 

NATE 

Corresponds 

to 

Definition 

1 

ii 

Yes 

2 

FAIL 

1.1 

2 

X 

No 

3 

OK 


3 

i 

Yes 

OK 

FAIL 

1.2 

4 

vi 

No 

5 

FAIL 


5 

xi 

No 

6 

FAIL 


6 


Yes 

OK 

FAIL 

1.2 

7 


Yes 

8 

FAIL 

3.1 

8 


No 

9 

OK 


9 


Yes 

8 

FAIL 

3.2 

10 

V 

Yes 

11 

FAIL 

4.1 

11 

xiii 

No 

12 

OK 


12 

V 

Yes 

11 

FAIL 

4.2 

13 

vi 

Yes 

OK 

14 

5.1 

14 

vii 

Yes 

OK 

15 

5.2 

15 

xiv 

No 

16 

FAIL 


16 

iii 

No 

17 

FAIL 


17 

XV 

Yes 

OK 

FAIL 

5.3 

18 

viii 

Yes 

OK 

FAIL 

6.1 

19 

ix 

Yes 

OK 

FAIL 

7.1 
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corresponds to a Defined Type, TERMINAL [<] 
will be set to “No,” and LOOKFOR [t] will con- 
tain some line-number in the Syntax Structure 
Table, to be filled in presently. We keep in mind 
that we can now use small integers as, in effect, 
names of Syntactic Types, by using them to 
index the Syntax Type Table. 

The Syntax Structure Table will contain a 
line for each Component of each Alternative of 
each Definiens in the Syntax Specification. Each 
line of the Structure Table will consist of four 
items : 

TYPECODE, an integer item, will contain 
line numbers referring to the Type Table ; 

STRUCT, a yes-no item; 

SUCCESSOR and 

ALTERNATE, integer items, will contain 
line numbers referring to other lines of the 
Structure Table, plus two special codes 
denoted by “OK” and “FAIL.” 

The Syntax Structure Table is constructed ac- 
cording to the following rules : 

Consider first a Defined Type which has no 
left-recursive Alternative in its Definiens. Re- 
serve a block of entries in the Structure Table. 
Assign an entry in the Structure Table to each 
Component in each Alternative in the Definiens. 
In the Type Table line corresponding to this 
Defined Type — say, t — set LOOKFOR [f] to the 
Structure-Table line number assigned to the 
first Component of the First Alternative of the 
Definiens. In each Component-line s, set TYPE- 
CODE [s] to the Type Table line number of the 
Syntactic Type which occurs as that Compo- 
nent in the Definiens. In each line correspond- 
ing to a Component which is not the last Com- 
ponent of an Alternative, set STRUCT to “No” 
and SUCCESSOR to the line corresponding to 
the next Component. In each line correspond- 
ing to a Component which is the last Component 
of an Alternative, set STRUCT to “Yes” and 
SUCCESSOR to “OK.” In each line corre- 
sponding to a first Component of any Alterna- 
tive except the last Alternative of the Definiens, 
set ALTERNATE to the line corresponding to 
the first component of the next Alternative. Set 
all other ALTERNATE fields to “FAIL.” 

If the Defined Type contains a left-recursive 
Alternative: (we shall here assume there is 


only one left-recursive Alternative — See Ap- 
pendix). Set the left-recursive Alternative 
aside temporarily, and carry out the above 
process for the other Alternatives. Then: 

Assign a Structure-Table line to each Compo- 
nent of the recursive Alternative except the 
recursive Component itself. 

Set TYPECODE in each of these lines, as 
above. 

Set SUCCESSOR and STRUCT in each of 
these lines, except for the last Component, as 
above. 

Call the first of these lines (the one corre- 
sponding to the Component which immediately 
follows the recursive Component in the De- 
finiens) the “handle.” 

Set ALTERNATE in each of these lines, ex- 
cept the handle, to “FAIL.” 

Set ALTERNATE in the handle to “OK.” 

Set SUCCESSOR in the line for the last 
Component of this Alternative to the handle, 
and set STRUCT in this line to “Yes.” 

Now, in the lines corresponding to last Com- 
ponents in all the other Alternatives in this 
Definiens, SUCCESSOR will have been set to 
“OK” by the nonrecursive treatment. Replace 
each of these “OK”s by the line number of the 
handle. 

The Syntax Type Table and the Syntax Struc- 
ture Table corresponding to Syntax II are 
shown together as Table 2. In the hope of 



Figure 1. Flow Chart for ANALYZE. 
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reducing the confusion involved in having en- 
tries in each table pointing to the other, wf 
have indicated the indexing of the Syntax Type 
Table with Roman numerals, and the indexing 
of the Syntax Structure Table with Arabic 
numerals, and we have added the names of the 
syntactic types corresponding to the lines of 
the Syntax Type Table. 

The Analyzer Algorithm 

The flow chart. Figure 1, illustrates an 
Analyzer working from Syritax Tables of the 
sort we have just constructed. The following 
remarks will help to follow the flow chart. 

— The global quantities GOAL, SOURCE, and 
CHAR are used as follows : 

GOAL is the line number in the Syntax 
Type Table corresponding to the Syn- 
tactic Type currently being considered 
(initially, the Starting Type) . 

SOURCE is the line number in the Syntax 
Structure Table of the Component cur- 
rently being considered (initially un- 
defined, hence set to zero) . 

CHAR is the (ordinal) number of the 
character position in the Input String 
next to be considered (initially set to 
the beginning of the String, here 1). 

— The operations of “Pushdown” and “Pop- 
up” are performed on these globals — this may 
be done by any appropriate mechanism. For 
definiteness, let us assume an index Y (initially 
zero) accessible to these two operations, and 
arrays GYOYO, SYOYO, and CYOYO. Then 
(in quasi ALGOL notation). 

Pushdown: Y := Y -1- 1 

GYOYO [Y] :=GOAL; 
SYOYO [Y] := SOURCE-; 
CYOYO[Y] :=CHAR; 

Popup: GOAL := GYOYO [Y] ; 

SOURCE := SYOYO [Y] ; 
if CHAR is mentioned in the call 
then CHAR :- CYOYO[Y]; Y 
:= Y - 1; 

Plausibly, CHAR is popped up when an Alter- 
native has failed, and the Analyzer must back 
up to the beginning of that Construction and 
try another Alternative at the same place ; and 
CHAR is not popped up — Whence left as it has 


been set by the successful recognitions — when 
some Alternative has succeeded. 

— Recognize is assumed as described earlier : 
It returns a success/failure indicator which is 
tested in the “Found?” box. For definiteness 
again, we shall assume that, when it succeeds 
in recognizing a Terminal Class, it places a 
Symbol-Table or Literal-Table line number in 
some global location, for the Generator to use. 

— The following sections of this paper will 
discuss possible uses to be made of the Ana- 
lyzer’s results. The routine which considers 
these results is named the “Generator,” and it 
is represented in this flow chart by a subroutine 
call box: “Generate.” When Generate is called, 
the value of SOURCE uniquely indicates the 
Syntactic Type which has been recognized and, 
moreover, the particular Alternative in the Defi- 
nition of that Syntactic Type which has just 
succeeded. The column headed “Corresponds 
to Definitions” has been added to the Syntax 
Structure Table to indicate this relationship. 
The numbers in this column correspond to the 
Alternatives in the “semantics” tables. Tables 
4 and 5. 

DOING SOMETHING USEFUL WITH THE 
ANALYSIS 

A syntactic analysis, such as that depicted 
verbally in the preceding section or via the flow 
chart in Figure 1, is an important part of the 
problem which must be solved by a compiler, 
but it is only a part. The goal of a compiler is, 
after all, to produce the coding required to 
carry out the procedure described in the pro- 
gramming language being compiled. This cod- 
ing might be desired as actual machine instruc- 
tions for some computer or it might be desired 
as instructions appropriate for some interpreter 
available on one or more machines or it might 
be desired in some other form. In any event, 
some further processing is required once the 
syntactic analysis is complete in order to “gen- 
erate” and format the coding to be output. 

Let us suppose that the syntactic analysis has 
proceeded to the point where some syntactic 
type has been recognized (in the flow chart. 
Figure 1, we have passed through the “GENER- 
ATE” box). The contents of SOURCE tells us 
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TABLE 3 
“Interpretive Semantics” 

for Syntax II 

1.1 <program> 

; : = < assignment > 

(Execute the <assignment> then halt} 

1.2 

1 <assignment> ; <program> (Execute the <assignment> then pro- 
ceed} 

2.1 <assignment> 

: := <variable> = <arith expr> 

(“Locate"’ the <variable> (determine 
its address for later assignment of 
value) ; then evaluate the <arith 
expr> ; then assign its value to the 
< variable >} 

3.1 <arith expr> 

: := <term> 

(Evaluate the <term> ; the value of the 
<arith expr> is this value} 

3.2 

1 <arith expr> -f <term> 

(Evaluate the <term> and then the 
<arith expr>; the value of the (de- 
fined) <arith expr> is the sum of 
these two values} 

4.1 <term> 

: := < factor > 

(Evaluate the <factor> ; the value of 
the <term> is this value} 

4.2 

1 <term> * <factor> 

(Evaluate the <term> and then the 
<factor> ; the value of the (defined) 
<term> is the product of these two 
values} 

5.1 <factdr> 

: := < variable > 

(The value of the < factor > is the value 
of the <variable>} 

5.2 

1 <integer> 

(The value of the <f actor > is the value 
of the < integer >} 

5.3 

1 (<arith expr>) 

(Evaluate the <arith expr> ; the value 
of the <factor> is the value of the 
<arith expr>} 

6.1 <variable> 

q/ 

(The value of the <variable> is the 
value most recently assigned to the 
variable Q/} 

7.1 < integer > 

0 

(The value of the < integer > is the 
value of the integer (j (according to 
the conventional representation of in- 
tegers) } 


which syntactic type has been recognized as 
well as which alternative construction of the 
syntactic type was built. Thus, some action or 
set of actions could be initiated at this point to 
process this syntactic type in a variety of ways. 

For example, Table 3 gives for each alterna- 
tive construction of the syntactic types of Syn- 


tax II a verbal description of the computations 
to be performed upon recognition of that con- 
struction. Corresponding to this table, the anal- 
ysis of the assignment statement 
X = NU* (Y + 15) 

could yield the following fragments of (slightly 
edited) verbal description: 
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Line 

Construction 

6.1 

< variable > 


<v 

6.1 

< variable > 

: := 

(V 

5.1 

<factor> 

: := 

<variable> 

4.1 

<tenn> 

: :z= 

< factor > 

6.1 

< variable > 


0/ 

5.1 

< factor > 

: 

< variable > 

4.1 

<tenn> 

: := 

< factor > 

3.1 

<arith expr> 

: : = 

<term> 

7.1 

< integer > 

: := 

0 

5.1 

< factor > 


< integer > 

4.1 

<term> 

: := 

< factor > 

3.1 

<arith expr> 

• •“ 

<arith expr> 
-|-<term> 

5.3 

< factor > 

: := 

(<arith expr>) 

4.1 

<terin> 

: := 

<term> 

* < primary > 

3.1 

<arith expr> 

: 

<term> 

2.1 

< assignment > 

:: = 

<variable> = 
<arith expr> X 


Source Description of Computation 

X The value* of the <variable> 

is the value most recently as- 
signed to the variable X. 

NU The value of the < variable > is 

the value most recently as- 
signed to the variable NU. 

NU The value of the < factor > is 

the value of NU. 

NU The value of the <term> is the 

value of NU. 

Y The value of the <variable> is 
the value most recently as- 
signed to the variable Y. 

Y The value of the <factor> is 
the value of Y. 

Y The value of the <tQrm> is the 
value of Y. 

Y The value of the <arith expr> 
is the value of Y. 

15 The value of the < integer > is 

15. 

15 The value of the <factor> is 

15. 

15 The value of the <term> is 15. 

Y -|- 15 The value of the <arith expr> 

is Y -|- 15. 

(Y 4- 15) The value of the <factor>, 

(Y + 15) is Y + 15. 

NU*(Y + 15) The value of the <term>, 
NU*(Y + 15) is NU*(Y + 15). 

NU* (Y + 15) The value of the <arith expr> 
NU*(Y-fl5) is NU*(Y-|-15). 

NU* (Y -f- 15) Assign the value of NU* ( Y -j- 
15) to the variable X. 


* Obviously, the current value of the variable X is not of interest here since the purpose of the assignment is to 
assign a value to X. 


Thus, tvith a certain amount of editing, the 
recognition of X = NU* (Y -f- 15) yields the 
verbal description : 

“Let NU and Y represent the values most 
recently assigned to the variables NU and 
Y ; then compute NU* (Y + 15) and assign 
the resulting value to the variable X.” 


Table 4 illustrates a very simple approach to 
the problem of machine code synthesis. With 
each syntactic construction is associated a set 
of actions. These actions are of tvyo types — 
output and set. The interpretation of the ac- 
tions is reasonably obvious. The bracketed 
numerals under the components of a construe- 
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TABLE 4 

Machine Code Semantics for Syntax II — Direct Generation 


1.1 <program> ::= 

12 

<assignment> 

[1] 

1 <assignment> ; <program> 
[1] [2] [3] 

1. 

Output END 



2.1 <assignment> 

< variable > = <arithexpr> 

1. 

Output CLA (addr ) 


[1] [2] [3] 


[3] 



2. 

Output STO addr 




[1] 

3.1 <arith expr> : := 

<term> 

1. 

Set addr = addr 


[1] 


[ ] [1] 

3.2 

I <arith expr> + <term> 

1. 

Output CLA (addr ) 


[1] [2] [3] 


[1] 



2. 

Output ADD (addr ) 




[3] 



3. 

Output STO (addr ) 




[ ] 

4.1 <term> 

< factor > 

1. 

Set addr = addr 


[1] 


[ ] [1] 

4.2 

1 <term> * <factor> 

1. 

Output LDQ (addr ) 


[1] [2] [3] 


[1] 



2. 

Output MPY (addr ) 




[3] 



3. 

Output STQ (addr ) 




[ ] 

5.1 <factor> : := 

<variable> 

1. 

Set addr = addr 


[1] 


[ ] [1] 

5.2 

1 <integer> 

1. 

Set addr = addr 


[1] 


[ ] [1] 

5.3 

1 (<arith expr> ) 

1. 

Set addr = addr 


[1] [2] [3] 


[ ] [2] 

6.1 <variable> 

n/ 

1. 

Set addr = the variable name 




[ ] 




recognized at this point of the input 
string. 

7.1 <integer> 

0 

1. 

Set addr = a symbolic name for the 


[ ] 

address in which will be kept the 
integer constant recognized at this 
point in the input string. 
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tion identify the components. The notation 
addr means the result address associated 
[ 1 ] 

with the component identified by [1] in the 
syntax specification. A bracketed blank repre- 
sents the syntactic type being defined and 
addr represents the result address of this 
[ ] 

type; when the addr appears in a machine 

[ ] 


instruction a temporary storage register is to 
be assigned. In the example below we use the 
notation tj for the j*'’ such temporary. Again 
consider the assignment 

X = NU*(Y -H 15). 

If this assignment is analyzed and actions car- 
ried out as per Table 4, the following results: 


Line 


Construct 

Source 

Result 

6.1 

< variable > 


n/ 

X 

addr |.<^ariable>] ~ ^ 

6.1 

< variable > 


<v 

NU 

addr [<variable>] “ 

5.1 

<factor> 


< variable > 

NU 

= NU 

4.1 

<term> 


<factor> 

NU 

addr = NU 

6.1 

<variable> 


(V 

Y 

addr |■<v»^iable>] ~ ^ 

5.1 

< factor > 


< variable > 

Y 

addrj<,„,to,>,= Y 

4.1 

<term> 

: := 

<factor> 

Y 

addr(<t^^„>3= Y 

3.1 

<arith expr> 

: 

<term> 

Y 

addrf<.ri^^„,>3= Y 

7.1 

< integer > 


0 

15 

addrj<.„,^„>,= 15 

5.2 

<factor> 

: 

<integer> 

15 

addrj<j„^,>j= 15 

4.1 

<term> 


<factor> 

15 

addr(<term>]= 

3.1 

<arith expr> 


<arith expr> 

-f <term> 

Y -f- 15 

CLA Y 

ADD=15 

STO ti 

5.3 

<factor> 

: := 

( <arith expr> ) 

(Y-hl5) 

addr«,.,^,>j= ti 

4.1 

<term> 

« . — 

<term> *.<factor> 

NU*(Y + 15) 

LDQ NU 

MPY ti 

STQ ta 

3.1 

<arith expr> 

: := 

<term> 

NU*(Y-K 15) 

addr expr>] — ^2 

2.1 

< assignment > 


<variable>= 
<arith expr> 

X = NU*(Y -f- 15) 

CLA ta 


STO X 


Given this mechanism, which we shall refer 
to in the sequel as the “Direct Generation 
Mechanism,” plus some mechanism for creating 
and housekeeping local or internal labels and a 


sufficiently sophisticated assembler (e.g., it can 
allocate memory for constants and variables) 
we have a rudimentary compiler. 
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1.1 <program> 

1.2 

2.1 <assignmeiit> 

3.1 <arith expr> 

3.2 

4.1 <term> 

4.2 

5.1 <factor> 

5.2 

5.3 

6.1 < variable > 

7.1 <integer> 


TABLE 5 

Machine Code Semantics for Syntax II- 
; := <assignment> 

[ 1 ] 

1 <assignment> ; <program> 
[1] [2] [3] 

: := I <variable> = <arith expr> 
[1] [2] [3] 


::= <term> 

[ 1 ] 

I <arith expr> -}- <term> 
[1] [2] [3] 


: := <factor> 

[ 1 ] 

I <term> * <factor> 
[1] [2] [3] 


: <variable> 

[ 1 ] 

I < integer > 

[ 1 ] 

I (<arith expr>) 
[1] [2] [3] 

q/ 


:;= 0 


-Deferred Generation 


1. 

Output END 


1. 

Process [1] 


2. 

Process [3] 


3. 

Output CLA addr 

[3] 

4. 

Output STO addr 

[1] 

1. 

Process [1] 


2. 

Set addr = addr 



[ ] [1] 

1. 

Process [1] 


2. 

Process [3] 


3. 

Output CLA addr 

[1] 

4. 

Output ADD addr 

[3] 

5. 

Output STO addr 

[ ] 

1. 

Process [1] 


2. 

Set addr = addr 



[ ] [1] 

1. 

Process [1] 


2. 

Process [3] 


3. 

Output LDQ addr 

[1] 

4. 

Output MPY addr 

[3] 

5. 

Output STQ addr 

[ ] 

1. 

Process [1] 


2. 

Set addr — addr 



[ ] [ 1 ] 

1. Process [1] 

2. Set addr = addr 

[ ] [ 1 ] 

1. Process [2] 

2. Set addr = addr 

[ ] [2] 

1. Set addr = the variable name rec- 

[ ] 

ognized at this point of the input 
string. 

1. Set addr = a symbolic name for the 

[ ] 

address in which will be kept the 
integer constant recognized at this 
point in the input string. 
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There is an interesting variation in this 
method of generating machine code. Let us 
suppose that, for one reason or another, it is 
desirable to complete the analysis for an entire 
<assignment> and produce the tree represen- 
tation of its syntax, and then to generate the 
machine coding which is to correspond to the 
tree so constructed. Table 5 illustrates this 
approach. It is essentially Table 4 to which 
some further actions have been appended. In 
this table the action denoted by a bracketed 
numeral preceded by the word ‘'process’’^ is 
interpreted : ‘‘do the actions for the component 
indicated,’’ Thus, again given the assignment: 

X = NU* (Y + 15) 

an analysis of this assignment with respect to 
Syntax II could be carried out resulting in the 
tree of Figure 2. Given this tree, the actions 
indicated in Table 5 could result in the follow- 
ing: 



V 

(x] 



<tenn> 

© I 


<£actoi> 

© I 


<variablp> 


© 


ir 

[NU] 


<foctoi> 



( <antl) oxpr> ) 


f(l 



© / 

© 


<arlth expt> 


<tenn> 

© 1 

[■* 

’ © ! 

<tenn.' 


<£5ctor> 

r 

® 1 


© 1 

<facior> 


<lncegar 

® 1 


@ 1 

<vartable> 


3 

® ! 


[15] 




Figure 2. Syntax Tree for the <Assignment> 
“X = NU* (Y 4- 15).” 


Tree Node 

Line of Table 5 

Result 

25 

2.1 

Process < variable >, node 2. 

2 

6.1 

Set addr( 2 ) = X ; actions complete for node 2, return to node 25. 

25 

2.1 

Process <arith expr>, node 24. 

24 

3.1 

Process <term>, node 23. 

23 

4.2 

Process <term>, node 7. 

7 

4.1 

Process <factor>, node 6. 

6 

5.1 

Process <variable>, node 5. 

5 

6.1 

Set addr(.,) = NU ; actions for node 5 complete ; return to node 
6. 

6 

4.1 

Set addr(B) = NU ; actions for node 6 complete; return to node 
7. 

7 

4.1 

Set addr, 7 ) = NU ; actions for node 7 complete; return to node 
23. 

23 

4.1 

Process <factor>, node 22. 

22 

5.3 

Process <arith ex?pr>, node 20. 

20 

3.2 

Process <arith expr>, node 14. 

14 

3.1 

Process <term>, node 13. 
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Tree Node 

13 

Line of Table 5 

4.1 

Resvlt 

Process <factor>, node 12. 

12 

5.1 

Process <variable>, node 11. 

11 

6.1 

Set addr(ii) = Y ; actions for node 11 complete; return to node 
12. 

12 

5.1 

Set addr(i 2 ) = Y ; actions for node 12 complete; return to node 
13. 

13 

4.1 

Set addr(i 3 ) =: Y ; actions for node 13 complete ; return to node 
14. 

14 

3.1 

Set addr(i 4 ) = Y ; actions for node 14 complete; return to node 
20. 

20 

3.2 

Process <term>, node 19. 

19 

4.1 

Process <factor>, node 18. 

18 

5.2 

Process <integer>, node 17. 

17 

7.1 

Set addr(i 7 ) = =15; actions complete for node 17, return to 
node 18. 

18 

5.2 

Set addr(i8) =15; actions complete for node 18, return to 

node 19. 

19 

4.1 

Set addr(i 9 ) = =15; actions for node 19 complete; return to 
node 20. 

20 

3.2 

Output CLA Y 

Output ADD =15 

Output STO ti 

Set addr( 20 ) = ti ; actions for node 20 complete; return to node 
22. 

22 

5.3 

Set addr, 22 ) = ti; actions for node 22 complete; return to node 
23. 

23 

4.2 

Output LDQ NU 

Output MPY ti 

Output STQ t 2 

Set addr,23) = tj; actions for node 23 complete; return to node 
24. 

24 

3.1 

Set addr( 24 ) = t 2 ; actions for node 24 complete; return to node 
25. 

25 

2.1 

Output CLA t 2 


Output STO X 

Set addr^ 2 r>) = X ; actions for node 25 complete ; exit. 


Note that we have changed notation slightly 
and used the notation addr,r,), for example, to 
indicate the (result) address which is as- 
sociated with node 5. 


The “code generation'' mechanism implied 
by the above description is as follows: At any 
point in time some action of some node is to 
be performed ; the actions and their interpreta- 
tions are : 
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Action 

Process a node 


Output code 


Set address 


End of actions 


Example Interpretation 

Process [1] Remember which action of the current node is being 

processed; perform the first action of the indicated 
node. 


Output CLA addr 

[ 1 ] 


Set addr 

[ ] 

addr 

[ 2 ] 


Procure the result address from the node indicated 
(the node corresponding to addr in the example) 

[ 1 ] 

and output the instruction. If the address indication 
is blank (e.g., STO addr ) select the next available 

[ ] 

temporary register address; set that address as the 
result address of the current node. 


Set the result address of the current node as the result 
address of the node indicated. 


Return to the next action of the node which ‘‘called'' 
for this node to be processed. 


Thus, the mechanism has a “control" element 
which is, at any point in time, considering some 
node. When a new node is to be processed, the 
“control" remembers which action to proceed 
with when “control" returns to the current 
node and then initiates the first action of the 
node to be processed. When all actions for a 
node are processed, “control" returns to the 
node which called for the current node and 
takes up actions for that node where it left off. 
Further, the mechanism has the ability to out- 
put code, associate result addresses with the 
node being processed, and procure temporary 
register addresses. 

Again given this mechanism, which we shall 
refer to in the sequel as the “Deferred Genera- 
tion Mechanism," plus some mechanism for 
creating and housekeeping local or internal 
labels and a sufficiently sophisticated assembler, 
we have a rudimentary compiler. 

There are, of course, other kinds of “actions" 
one could associate with a node. For example, 
it would be quite straightforward to associate 
actions for producing a different tree structure 
than the complete syntax tree as depicted in 
Figure 2. This might then produce, from an 
analysis of the assignment 

X = NU*(Y + 15) 

the simple tree (or “Polish prefix" representa- 
tion) : 



It will be apparent that the Direct Generation 
Mechanism does not require that the complete 
syntactic tree actually be built as the analysis 
proceeds. Rather, it is sufficient that there be 
some means (for example, a push down store is 
adequate) for “remembering" the result ad- 
dresses which have yet to be “used." Further, 
while this technique appears, on the face of it, 
to be quite rapid and efficient (no tree need be 
kept, shorter “driving tables" — compare Table 
4 with Table 5) it is subject to some serious 
limitations. In particular, since the coding for 
a syntactic type is actually being output as 
that type is recognized, there must (for most 
languages) be some mechanism for “erasing" 
a patch of code generated for a syntactic type 
recognized while the analyzer was attempting 
recognition of some larger construction when 
it turns out that the syntactic type in question 
does not enter into the construction of the 
larger syntactic type as it is finally built. 
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In the above example with the Deferred Code 
Mechanism we used < assignment > as the syn- 
tactic type over which (i.e., over the tree repre- 
sentation of which) generation was to occur. 
It is, of course, possible to generalize this to 
allow any syntactic type to be “tagged” as big 
enough to call the generation mechanism. Thus, 
at one extreme a complete program would have 
to be recognized before any generation was per- 
formed (a “two pass” compiler) and, at the 
other extreme, each syntactic t 3 T)e would call 
for generation (“one pass” compiler) thus 
making the Deferred Generation Mechanism 
essentially the same as the Direct Generation 
Mechanism. It should be noted that, employing 
the Deferred Generation Mechanism, once the 
tree corresponding to some syntactic type has 
been processed (“generated over”) it can be 
erased with the exception of its top-most or 
root node which may have to remain to supply 
a “result address” for the generation over some 
larger construction. 

It must be emphasized that both these 
mechanisms are very rudimentary and for use 
within a compiler would require some embel- 
lishment in order to be practical. Thus, for 
example, it seems rather a shame to generate 
a complete syntax tree for some (perhaps frag- 
ment of some) program and then make essen- 
tially no use of the contextual information con- 
tained implicitly in the tree structure. Indeed, 
a rather simple addition to make some use of 
this information would be the following: con- 
sider that we add conditional actions of the 
following sort : 

IF addr = addr , SKIP n 
[a] [b] 

IF addr ^ addr , SKIP n 
[a] [b] 

SKIP n 

where in the first two the truth of the relation 
causes the n actions following to be skipped and 
the SKIP n action causes the n actions follow- 
ing to be skipped. If we further add “AC” and 
“MQ” as special values for addr , then for 

[ ] 

a single address computer (say, like the IBM- 
7094), it would be possible to generate rather 
more efficient coding by placing results tem- 
porarily in the accumulator (AC) or multiplier 
quotient (MQ) registers and then checking for 


the use of these locations for operands before 
generating coding. Thus we might then as- 

sociate with 

the construction <arith expr> 

<arith expr> -f- <term> in Table 5 the 

actions : 


1. 

Process [1] 

2. 

Process [3] 

3. 

IF addr = AC, SKIP 4 


[3] 

4. 

IF addr = AC, SKIP 1 


[1] 

5. 

Output CLA addr 


[1] 

6. 

Output ADD addr 


[3] 

7. 

SKIP 1 

8. 

Output ADD addr 


[1] 

9. 

Set addr = AC 

r 1 


[ ] 


These would have the effect of preserving 
results of additions in the accumulator and 
remembering that they were there in order to 
avoid redundant store-load instructions. In 
order to fully utilize such a facility, including 
keeping track of the MQ contents as well, some 
further mechanism for indicating the AC or 
MQ are empty or full and some mechanism for 
storing their contents would be required. The 
basic scheme is, however, reasonably clear from 
the example. The MAD '^’ranslator has facilities 
similar to these built into its code generation 
mechanism. 

Even such a mechanism as this barely makes 
use of the rather rich store of contextual in- 
formation available. In order to do so, how- 
ever, we would require some means for talking 
about the nodes of the tree relative to any given 
node of interest (such as a nodes “father,” 
“father’s father,” “father’s right-brother’s 
son’s son,” and so on). Further, it would prob- 
ably be desirable to extend the “control” some 
what and allow more general “tree walks” than 
simply processing “sons” and returning to 
“father.” Also, if contextual information were 
gathered, it would have to be “parked” some- 
where and thus an addition of further informa- 
tion fields associated with each node would be 
useful plus, perhaps, some “working variables” 
which various actions could reference and set. 



50 


PROCEEDINGS— SPRING JOINT COMPUTER CONFERENCE, 1964 


It is clear that one could extend the whole gen- 
eration mechanism to include control sequenc- 
ing and actions which were programs in a 
rather specialized programming language. The 
paper by Warshall and Shapiro [^] contains a 
brief sketch of one such extension which has 
been successfully tried on several computers 
and for several languages. 

SOME PROBLEMS AND LIMITATIONS 

The techniques and fragments of techniques 
which we have discussed above are all, in one 
way or another, relevant to the design and con- 
struction of compilers. Furthermore, these tech- 
niques, in the simplified form in which we have 
presented them, are of sufficient general utility 
that they should be in every ‘‘systems'’ program- 
mer’s bag of tricks. For example, the ANA- 
LYZE algorithm — coupled with either the Di- 
rect or Deferred Generation Mechanism dis- 
cussed in the preceding sections — can be applied 
to a variety of programming tasks imbedding 
simple algebraic facilities in an assembly pro- 
gram, handling the “translation” of free format 
or formatted control cards, interpreting de- 
scriptions of formatted data, and so on. How- 
ever, for the serious construction of compilers 
these techniques represent only a few of the 
techniques required and they are subject to 
some limitations. 

Some of the considerations which must enter 
into any compiler design and which are affected 
to one degree or another by the choice of 
method of analysis and its linkage to the gen- 
eration of code are the following : 

— error analysis and recovery 
— analysis of languages in which recogni- 
tion order is not the same as genera- 
tion order 

— processing declarations 
— generation of highly efficient coding 

Let us consider these questions. 

Error Analysis and Recovery 

An error is detected, for example, in Syntax 
II, when the analyzer cannot recognize a <pro- 
gram>. Although the exact point in the input 
string past which recognition fails will be 
known, it is extremely difficult to determine 


exactly why the error occurred and to, in a 
general way, devise means for recovery. 

Several schemes exist for dealing with this 
problem, notably : 

1) A scheme which permits specification of 
“no back up” on certain constructs. For ex- 
ample, in Syntax II, no back up on recognition 
of or “(” could help isolate the reasons 
for a failure. 

2) A scheme due to E. T. Irons [5] which, in 
effect, carries along all possible parses of an 
input string. 

3) Special “error” syntactic types which 
could be defined in the syntax. 

At the present time there is no completely 
satisfactory scheme for dealing with syntactic 
errors discovered in the course of predictive 
analysis. If the programming language which 
is being analyzed has sufficiently simple struc- 
ture that it is a precedence grammar, the tech- 
nique of bounded context analysis is probably 
a better technique to utilize. A discussion of 
precedence grammars is given in Reference 
[6] ; the use of bounded context analysis is de- 
scribed in Reference [7] . 

Recognition Order Differs from Generation 
Order 

Some reasons why the order of generation 
might be different from the order of recognition 
are: 

1) The detection of, and generation of coding 
for, sub-expressions which are common to two 
or more parts of the program is desired. 

2) The detection of computations which are 
invariant in some larger computation (for ex- 
ample within loops) is desired. 

3) Languages other than the usual program- 
ming languages are being translated, for ex- 
ample, data description languages or the com- 
putational fragments associated with TABSOL- 
like descriptions are to be processed. 

Reference [1] describes some techniques for 
coping with these problems in a compiler which 
uses predictive analysis. 
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Handling Declarations 

Here the problem is that the “actions” are 
not to generate coding (usually) but to change 
the syntax — normally through type coding in- 
formation inserted into a symbol table. For- 
mally, however, a declaration of type is really 
the appending of a syntax rule. Thus the 
ALGOL 60 declaration 

“real X, Y;” 

means that the two new syntax rules 

“<real var> : :=X” 
and 

“<real var> : :=¥” 
must be appended to the syntax. 

Other declarations may cause changes to the 
full compiler — for example, debug mode decla- 
ration, and the like. 

Generation of Highly Effleient Coding 

This cannot be accomplished by generating 
code directly as the analysis is performed since 
common sub-expressions, invariant computa- 
tions and the like couldn’t be detected reason- 
ably and special registers such as index 
registers certainly couldn’t be allocated on any 
global basis, which is necessary if any really 
effective use is to be made of them. 

Many of the manipulations required to collect 
the information pertinent to optimizing code 
are not particularly easily done (or, at least 
efficiently done) with the source material in a 
syntax tree form. Reference [11 describes a 
method by which such optimizations are per- 
formed over a set of “macro-instructions” 
which are “generated” by a technique similar 
to that depicted by Table 5. 

SUMMARY AND CONCLUSION 

In this paper we have tried to explain the 
workings of syntax-directed compiling tech- 
niques — or perhaps better, of those parts of a 
compiler in which the actions to be performed 
can reasonably be associated with the structure 
of the input string, A satisfying understanding 
of the operation of a syntax-directed analyzer 
can only be attained by actually playing through 
a few examples. We recommend this as a worth- 


while experience to anyone who is interested, 
and so we have given a sufficiently detailed 
description of a particular example to permit 
the reader to write statements of his own in 
the simple language, and play them through the 
Analyzer and any one of several code-genera- 
tion techniques. 

There remains the question of evaluating syn- 
tax directed compiler techniques in comparison 
to other approaches. 

On the face of it, syntax directed analyzers 
cannot be as efficient as operator-^precedence 
techniques for the simple task of recognizing 
input structures. This follows from the fact 
that, no matter how cleverly the language de- 
signer, or specifier, arranges the elements of 
his Syntax Specification, the Analyzer will 
necessarily spend some percentage of its time 
exploring blind alleys. Clever specifications can 
make the blind alleys less frequent and shorter, 
but even for the simplest of languages, there 
will be some. 

Thus, in any situation where the primary 
consideration is speed of the compiler itself, 
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suitable. But this, we argue, is true only if the 
quality of the coding produced is also of rela- 
tively little importance. In our experience with 
attempts to generate highly efficient optimized 
coding for several different machines, we find 
that the time spent in analyzing is a small 
fraction of the total — even using very sloppy 
Syntax Specifications, The most important 
question in compiling system design today, we 
reiterate, is not the “understanding” of the 
source language — that is a solved problem — but 
rather the generation of really good object- 
language coding. 

One of the principal arguments in favor of 
syntax-directed techniques is that it is very 
easy to change the specification of the language, 
or, indeed, to switch languages, merely by 
changing the Syntax Tables — ^no modifications 
of the algorithms are required. And this is in 
fact true, with some restrictions. But now that 
techniques exist for automatically producing 
operator-precedence tables from a Syntax Speci- 
fication [6], the syntax-directed compilers no 
longer have a monopoly on this useful feature. 
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A further advantage of syntax-directed 
analysis remains, up to the present, only poten- 
tial. These techniques are evidently not of the 
‘‘bounded context” sort— a syntax directed 
Analyzer can take into account as large a con- 
text as required to perform its recognition 
(admittedly, at a definite cost in speed) . Hence, 
when the day comes that we need to perform 
analysis of source languages of much less rigid 
structure, syntax-directed techniques will be 
more immediately applicable than the tech- 
niques which are designed to take advantage 
of the restrictive properties of present pro- 
gramming languages. 

In summary, syntax-directed techniques have 
a definite place in today's computing world, and 
promise to play an even more important role in 
the future. 

APPENDIX 

Some Further Traus formations of the Syntax 
Specification and the Syntax Tables 

In constructing the Syntax Tables, we de- 
scribed a complicated operation for avoiding 
the problem of left-recursive Alternatives in a 
Syntactic Type Definition. We can describe this 
as a transformation within the Syntax Specifi- 
cation itself, and, at the same time, include 
some features which improve the efficiency of 
the encoding of the Syntax. 

First, we extend the idea of “Component” to 
include two new forms : 

1) An Alternation (of one or more Construc- 
tions), enclosed in square brackets ‘[' and 
These brackets are assumed to be different 
from any of the Terminal Characters (if they 
were not, we'd use some other signs) . 

2) An Alternation enclosed in braces and 
again assumed different from any of the 

Terminal Characters. 

Second, we apply a left-distributive law: 

(T2) Whenever, within a single definition, 
two (or more) Alternatives start with the same 
Component (or sequence of Components), re- 
place all of these Alternatives with a single one, 
whose last Component is the bracketed Alterna- 


tion of the non-common parts of the original 
Alternatives, and whose first Component (s) is 
(are) the one(s) common to the original Al- 
ternatives. It is also useful to introduce the 
idea of a “null” Syntactic Type — effectively a 
Terminal Type which is always recognized 
whenever it is called for — denoted here by a 
capital lambda. Then, for example: 

<a>::=<b> <c> <d> <e> | 

<b> <c> I <b> <c> <f> i <g> 

would be transformed into : 

<a>::z=<b> <c> [ <d> <e> | 

<f> I A] I <g> 

(Obviously, if the Analyzer is going to consider 
Alternatives in the order in which they are 
written, a null Alternative should always ap- 
pear last in an Alternation.) 

Having applied (T2) to any Definition, there 
can be at most one left-recursive Alternative, 
and if there is one, we can rewrite the definition 
according to : 

(T3) Put the left-recursive Alternative as 
the last Alternative in the definition ; if there is 
more than one other Alternative, put square 
brackets around the Alternation consisting of 
the nonrecursive Alternatives; delete the sign 
‘ ' preceding the last Alternative, and delete the 
first Component of that Alternative (which will 
be the same as the Defined Type of that Defini- 
tion) ; then enclose the remaining Components 
(of this formerly last Alternative) in braces. 

Thus the ultimate transform of a left-recur- 
sive definition has as Definiens a single Con- 
struction, the last Component of which is 
“iterated” (enclosed in braces) . As an example, 
a Definition which was originally: 

<a>::zz:<b> <c> | <a> <d> <e> | 
<a> <f> I <g> <h> 

would be transformed into : 

<a>::=: [<b> <c> | <g> <h>] 
{<d> <e> I <f>} 

The modifications to the rules for constructing 
the Syntax Tables to represent Definitions in 
this form is left as an exercise for the reader. 
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The Analyzer flow-charted in Figure 1 should 
work on the resulting tables. 

Three of the Definitions in Syntax II would 
be changed by application of (T2) and (T3) : 

<program> : :=<assignment> 

[ ; <prograni> | A] 

<arith expr> ::= <tenn> 
{+<term>} 

<term>::= <factor> {* <factor>} 

Now, an analogous pair of transformations — 
a right-distributive law, and the elimination of 
right-recursive Constructions — could be ap- 
plied, and this would render the Syntax Specifi- 
cation still more compact. The language used 
by Brooker and Morris [10] for specifying 
syntax is essentially one of this sort, although 
the notation used is rather diiferent. 

More ''Groundedness*' Problems 

The treatment we have described takes care 
of left-recursive definitions, as long as the re- 
cursion occurs within the definition of a single 
Syntactic Type. It will not handle the infinite- 
loop problem engendered by, as an example : 

<a>::=<b> Z | X 
<b>:::zz<a> Z | Y 

and it is in general true that, for an Analysis 
technique of the ‘'top down’’ sort, as presented 
here, a Syntax Specification with such left- 
recursive loops will not be adequate. This leads 
to the requirement of an additional condition 
on Syntax Specifications: If we say that a 
Construction is “firmly grounded” when its 
first Component is either a Terminal Type or a 
firmly grounded Defined Type, and a Defined 
Type is firmly grounded when all of its non-left- 
recursive Definientes (in the Basic Syntax 
Specification) are firmly grounded, then: 

(C4) Every Defined Type must be firmly 
grounded. 

In practice, this is not a serious restriction. 
The simplest test for this condition is to try to 
run the Analyzer — it stops requesting input and 
goes into a loop. It is usually a simple matter to 
rewrite the Syntax Specification to eliminate 


the difficulty. In the above example, this could 
be done in several ways, one of which is : 

<a>::— <b> 

<b>::= [X| Y] {Z } 

A Modified Analyzer 

The Analyzer algorithm of Figure 1 is de- 
signed to call the Generator upon recognition 
of every instance of a Syntactic Type, even if 
it is not the “longest” instance of that type 
present at the given position of the Input string. 
It turns out to be the case that, for all the 
standard programming languages, when the 
Analyzer needs to recognize a recursively de- 
fined Syntactic Type, it wants the longest string 
which is a member of that Type— that is, it 
should keep re-entering the iterated Component 
of the Definition (in our latest transformed 
form) until it meets a failure. The Syntax 
Tables and Analyzer described in this paper 
will find the “longest” instance of a type but 
this Analyzer does report each partial recogni- 
tion also. 

Now, a slight change in the Analyzer algo- 
rithm allows it to avoid reporting partial recog- 
nitions to the Generator, and call it only when 
it has completed recognition of the longest 
instance of a Syntactic Typ^. For those who 
mighCbe interested in exploring this point, the 
changes to be made are : 

1) Eliminate the boxes (GENERATE) and 
(struct [SOURCE] ") ? from the flow chart. 

2) Insert a (GENERATE? box between the boxes 
[Popup GOAL. SOURCE! and (SOURCE=0?) 

The entire Syntax Structure Table entry 
STRUCT can also be eliminated. 

In order to correctly record the recognitions, 
the Generator must construct a slightly differ- 
ent tree (we are here assuming operation in 
the “deferred generation” mode), the form of 
which is best illustrated by an example: 

For the Input (sub-) string, 

A+B+C+D 

the Generator discussed in this appear will pro- 
duce a (sub-) tree: 
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<arith expr> 


<arith expr> -f <lenn> 

\ 

<arith expr> + <term> 



D 


<arith expr> -f <term> 


<terin> 


B 


If the modifications mentioned above are made 
in the Analyzer, each of the terms in the input 
string will cause the Analyzer to signal recog- 
nition of a <term>, but only one recognition 


of <arith expr> will be reported, and that 
after all four <terms>. The Generator will 
then tree this recognition as follows: 



A B 

To use a tree of this form for the generation of 
output code, the Generator language must be 
extended, for example, to refer to “siblings” 
instead of just “sons” and “fathers” and also 
to “count” sons (the number per node is not 
fixed, but depends upon the actual input) or to 
recognize the non-existence of a “sibling,” etc. 
Typically, the processing sequence for the above 
tree is for a <term> node first to send control 
to its first son, to evaluate the sub-tree, then 
when control returns, to send control “hori- 
zontally” — to its siblings — for evaluation of the 
other <term>s; only the last <term> node 
would send control back to the father. Refer- 
ences [1] and especially, [2], discuss this in 
more detail. 

A More Subtle Problem 

The Analyzer algorithm given in this paper 
has the following property: Assume that a 
given Syntactic Type <x> has been recognized 
at character-position c in the input string, and 
the Analyzer fails to recognize any of the pos- 
sible successors of <x> within the Definition of 


C D 

its current goal. The Analyzer will report fail- 
ure for this Alternative of the Definition. It is 
possible to formulate a Syntax Specification in 
such a way that this reported failure would be 
erroneous : if there were another, different, sub- 
string starting at c and which was also an <x> 
(this other substring would correspond to an 
Alternative occurring later in the Definition of 
<x> than the Alternative which was first suc- 
cessfully recognized) . It is certainly possible to 
design an Analyzer which will keep track of the 
information necessary to allow this kind of 
“back-up” (see References [5] and [13]), but 
for the present purposes it would have en- 
cumbered the description with a good deal of 
additional mechanism — essentially, the Syntax 
Structure Table would have another item, en- 
coding the converse of the relation represented 
by SUCCESSOR, and push-down storage would 
be required to keep track of the SOURCE lines 
of the Types successfully recognized, instead of 
just those which are currently being worked on. 
Using the Analyzer and Generator described in 
this appendix, it becomes much easier to ac- 
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commodate this feature, since the required 
additional information can easily be kept in the 
tree while it is being built. Reference [2] dis- 
cusses this question in detail and with examples. 

“Top Down” vs. “Bottom Up” 

The Analyzer described in this paper is of 
the sort known as “top down,” the appellation 
referring to the order in which the Analyzer 
sets its goals. The present Analyzer will always 
set as its next goal some Syntactic Type which 
appears as a Component in the Definition of 
its present goal — any time the Recognizer is 
called to find a Terminal Type, the pushdown 
storage of the Analyzer will contain a record 
for each Syntactic Type in a chain reaching 
down from the Starting Type to the Terminal 
Type. The order in which a “bottom up” Ana- 
lyzer sets its goals is much more difficult to 
describe, but the actions of the two types can 
be impressionistically sketched as follows : 

The “top down” Analyzer sets a goal and 
tries all possible ways of achieving that goal 
before giving up and replacing the goal with an 
alternative. 

The “bottom up” Analyzer, having recog- 
nizBci a Syntactic Type, checks whether it has 
“gone astray” in trying to reach its goal or 
whether that Type is indeed a possible first 
Component of a first Component of ... of the 
goal. If the latter, it continues processing input 
until it has built another Type of which the 
previous one is a first Component, and goes back 
to the checking. If it has gone astray, it backs 
“down” and tries to see if it can construe the 
input differently, to approach its goal along a 
different chain of intermediate types. 

E. T. Irons’ original syntax-directed Ana- 
lyzer design was of this type (Reference [4] ) . 
It might be interesting to characterize an Ana- 
lyzer similar to Irons’ within the terminology 
of this paper. 

We start with the Basic Syntax Specification, 
and first build a magical matrix which will 
answer the question “Can a start with 
where a and y8 are Syntactic Types. The relation 
“can start with” is defined recursively as fol- 
lows: 


A (Defined) Syntactic Type a can start with 
the Syntactic Type /8 either 

1) if j8 appears as the first Component of 
some Definiens of a, or 

2) if there exists a y which can start with p, 
and y occurs as the first Component of some 
Definiens of a. Irons’ paper [4] gives an ele- 
gant technique for constructing this matrix. 
Note that a can start with oc, if it is lef Re- 
cursively defined, but not otherwise. 

The next step is to transform the Basic Speci- 
fication : 

First, remove the Defined Type and the sign 
‘ from the left-hand end of the Definition, 
and place ‘=::’ followed by the Defined Type 
at the right-hand end. (In effect, when a defini- 
tion is considered from left to right, the Type 
which it defines is not known until all the Com- 
ponents have been recognized.) Hereafter, the 
Defined Type will be called the “Result” of the 
Definition. For example, the Definition of 
<assignment> becomes: 

<variable>=<arith expr> = : : 
<assignment> 

Second, apply the left-distributive law to the 
set of definitions, introducing Alternation signs 
‘I’ as required. To illustrate, the following two 
interesting lines would result, in the example 
language of this paper : 

<term> [*<factor> =::<term> | 

= : : <arith expr>] 

<variable> [= <arith expr> = :: 
<assignment> | =::<factor>] 

The effect of this transformation is to reduce 
the set of definitions to one line for each Syn- 
tactic Type which occurs as a first Component 
of one of the original Simple Definitions. 

From the resulting set of “definitions” syntax 
tables are constructed, analogous to the ones in 
this paper. But the analogue of the Snytax 
Type Table is now a directory of first Compo- 
nents, each entry of which points to the first of 
a block of structure-table entries which encode 
the remainder of the “definition,” now includ- 
ing a mention of the Result of the original 
Simple Definition (suitably flagged to avoid 
interpreting it as just another successor) . 
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For use with a '‘bottom up'' Analyzer of this 
sort, the Terminal Types of the language (or 
at least those which appear as first Components 
in any Definition) must be unambiguously rec- 
ognizable independently of context — that is, the 
Recognizer may be told merely to “find some- 
thing," and it will return with an indication of 
the particular Terminal Type it recognized. 

To start the analysis, the goal is set to the 
Starting Type, and the Analyzer proceeds as 
follows : 

Step 1 Call the Recognizer; the Terminal 
Type it reports is placed in Type In Hand. 

Step 2 Can the goal start with the Type In 
Hand? If not, go to Step 4. If so, proceed to 
Step 3. 

Step 3 Consult the Structure Table at the 
point indicated in the Type Table for the Type 
In Hand. Push down the current goal and its 
source and set up as new goal the Component 
mentioned in this entry in the Structure Table. 
(This structure-table entry is the “source of 
this goal") . Go to Step 1. 

Step h Is- the Type In Hand the same as the 
goal? If not, go to Step 7. If so, proceed to 
Step 5. 

Step 5 (We have attained a goal) Consider 
the source of this goal. Is the successor of that 
entry in the Structure Table fiagged as a Re- 
sult? If so, go to Step 6. If not, replace the 
goal with the (Syntactic Type mentioned in 
the) successor of the source, reset the source to 
point to this successor, and go to Step 3. 

Step 6 (We have recognized all the Compo- 
nents of a Definition.) Place the name of the 
Type mentioned in the Result entry into Type 
In Hand, pop up the goal (and source), and go 
to Step 2. 

Step 7 (We have “gone astray.") Consider 
the Structure Table entry for the source of the 
current goal. Does it have an alternate? If not, 
go to Step 8. If so, restore the input-string 
pointer to the value it had when the current 
goal was first set up, replace the current goal 
with the alternate (adjust source), and go to 
Step 1. 


Step 8 Pop up the goal and source, and go 
to Step 4. 

For programming languages of the current 
sort, there is no clear advantage in favor of 
either the top down or bottom up analysis tech- 
niques, insofar as efficiency of the Analyzer is 
concerned. For either technique, it is possible 
to design a language and Syntax Specification 
on which the technique will perform very 
poorly, while the other one will not be nearly 
as bad. The choice between the techniques is 
generally made on the basis of considerations 
other than raw speed of the analysis, such as 
the kind of output desired from the analysis, 
the possibility of error detection and correction, 
or personal taste. 

Bootstrapping'' 

As a final comment, we merely point out the 
fact that the language of the Syntax Specifica- 
tion is itself a rather straightforward, well- 
behaved language, easily susceptible of being 
described by a Syntax Specification. A version 
of the compiler can be written which uses a 
Specification of the Syntax-Specification-Lan- 
guage to “drive" it, and produces, instead of 
output code in a machine language, a set of 
Syntax Tables which encode the Syntax Specifi- 
cation it receives as input. This has, in fact, 
been done (References [1], [2]). 
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INTRODUCTION 

If a compiler is to generate efficient object 
code, there are several different kinds of opti- 
mization which should take place. Each of these 
optimization procedures has a preferred do- 
main : that is, some algorithms prefer to oper- 
ate over the input string, others over the tree 
which describes the syntax of the string, others 
over the “macro-instructions” which are gen- 
erated from the tree, and so forth. In an earlier 
paper,i one of the present authors pointed out 
the necessity for employing the tree form in 
particular as a natural domain for optimizers 
which consider sjmtactic context and suggested 
that, just as Irons^ and others had built gen- 
eral-purpose table-driven parsing algorithms, 
one could also build a general-purpose table- 
driven program for getting from trees to 
macro-instructions. The final compiler design 
presented here is the result of pursuing that 
kind of thinking somewhat farther. 

COMPILER ORGANIZATION 

The compiler is composed of five phases (not 
“passes,” if that means pulls of the input tape) : 

1. A ssmtactic analyzer which converts a 
piece of input string into a tree-representation 
of its syntax. 

2. A generator, which transforms the tree 
into a sequence of n-address macro-instructions, 
investigating syntactic context to decide the 
emission. 


3. An “in-sequence optimizer^’ (ISO) which 
accumulates macros, recognizes and eliminates 
the redundant computation of common subex- 
pressions, moves invariant computations out of 
loops, and assigns quantities to special regis- 
ters. 

4. A code selector which transforms macros 
into syllables of machine code, keeping com- 
plete track of what is in special registers at 
each stage of the computation. 

5. An assembler which simply glues together 
the code syllables in whatever form is required 
by the system with which the compiler is to 
live : symbolic, absolute, or relocatable, with or 
without symbol tables, etc. 

The first four phases are encoded as general- 
purpose programs; the fifth has been handled 
as a special-purpose job in each version of the 
compiler, and will therefore not be covered in 
the present discussion. 

Phase I: Analyzer 

The analyzer is of the “top-down” syntax- 
directed variety, driven by tables which are in 
effect an encodement of the source language’s 
syntax as given by a description in the meta- 
linguistics of the ALGOL 60 report® (the so- 
called “Backus normal form”). There are sev- 
eral features of interest in this encodement: 
rules are shortened where possible by applica- 
tion of the distributive law (thus, “<A> 
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<B> I <A> <C>” would be coded as '‘<A> 
(<B> I <C>)''; it is possible to define types 
by naming an arbitrary scanner which recog- 
nizes them, thus eliminating syntax-chasing 
when forming ordinary identifiers, etc.; left 
and right recursive rules are carried in a trans- 
formed condition whose effect is to force recog- 
nition of the longest possible string which satis- 
fies the rule. The analyzer tables contain some 
information which is not syntactic, but rather 
concerned with compiler control (how much 
tree should be built before the generator is 
called, for example) or with specifying addi- 
tional information to be placed in the tree for 
later use by the generator. 

Phase II: Generator 

The generator algorithm ‘‘walks'" through the 
tree from node to node, following directions 
carried in its tables. As it walks, macro-in- 
structions are emitted from time to time, also 
as indicated in the tables. The encodement of 
a set of tables (a so-called “generation strat- 
egy") is based upon the idea of a “relative tree 
name." A relative tree name is, formally, a 
function whose domain is the set of nodes in 
the tree and whose range is its domain union 
zero. At any given time, the generator is look- 
ing at (has walked to) some particular node 
of the tree. Every relative tree name is then in- 
terpreted (evaluated) as a function of that 
node as independent variable. A relative tree 
name is a rule for getting from “here" to some 
neighboring node of the tree. Thus, if we may 
view the tree as a genealogical one, a relative 
tree name might “mean" father or first son or 
first son of second son, for example, of the node 
the generator is currently considering. 

A generation strategy is composed of a set 
of rules each of which consists of a description 
of some kind of node which is somehow of 
interest together with a list of things to be 
done when a node of that kind is in fact encoun- 
tered. A kind of node is generally distinguished 
by its own syntactic type and the types of some 
of its neighbors. The things to be done include 
walking to a neighboring node and emission of 
macros whose variables are neighboring nodes. 
In all cases, neighboring nodes are named in 
the tables by relative tree names. 


Phase III: In-Sequence Optimizer 

The ISO accepts macro-instructions emitted 
by the generator. The processing of a macro 
usually results in placing the macro in a table 
and sending a “result" message back to the 
generator. Macros also instigate various book- 
keeping and control operations within the ISO. 

The processing of a macro is controlled by a 
table of macro descriptions. A macro may or 
may not be capable of being combined with 
others into a common-subexpression ; the argu- 
ments of a macro may or may not be commuta- 
ble, and so forth. The ISO will detect macros 
whose arguments are literals and in effect exe- 
cute those macros at compile time, creating 
new literals. If a macro may be handled as 
(part of) a common subexpression and is not 
computable at compile time, the ISO will recog- 
nige a previous occurrence of the same macro 
as equivalent if none of its arguments have 
been changed in value either explicitly or im- 
plicitly by any of the messages that have been 
received in the interval. 

At some point the ISO receives a macro- 
instruction that signals an “end region" con- 
trol operation. This causes the ISO to perform 
a set of “global" optimizations over the region 
of macros just completed. These global optimi- 
zations include the recognition of those com- 
putations which remain “invariant" within the 
region and the reservation of “special regis- 
ters" such as index registers to reduce the num- 
ber of special register loads and stores within 
the region. 

Phase IV: Code Selector 

The code selector produces symbolic machine 
code for a region of macros after the ISO has 
collected these macros and performed its vari- 
ous optimizations. The code selector is driven 
by a table of code selection strategy. The do- 
main of a strategy is the region of macros and 
a “track table" which represents the condition 
of the registers of the target computer. 

The code selector views the macros as nodes 
of a tree structure; that is,. certain partial or- 
derings exist which guarantee that the code 
emitted preserve the computational meaning of 
the macros, but within these constraints the 
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strategy can order the computation according 
to its convenience, as a function of the avail- 
ability of registers and results. The preferred 
mode of operation is to postpone making deci- 
sions about what code to generate for a macro 
until it is known how the result of that macro 
will be used. 

The code selector also makes use of certain 
information gleaned from the macro-descrip- 
tion tables in order to predict how the special 
registers (index registers, etc.) will be used. 
These predictions enable the code selector to use 
such registers intelligently. This local optimi- 
zation, combined with the global reservation 
of special registers by the ISO, results in a 
fairly effective use of these registers. 

BOOTSTRAP TECHNIQUE 

There are three major sets of tables to be 
prepared if the compiler is to be particularized 
to a specific source language and target ma- 
chine. These are the syntax tables, the genera- 
tion strategy tables, and the tables of macro 
description and code selection. The bootstrap 
technique is simply a method of automating 
part of the process of preparing these tables. 

A group of three languages was developed, 
corresponding to the three tables. These 
languages are all describable in the Backus no- 
tation and thus capable of analysis by the ana- 


lyzer. A set of syntax tables and generation 
strategy tables for these languages was encoded 
by hand and installed in the compiler, which 
was then capable of translating from these spe- 
cial languages into macro-instructions. The 
link from the generator to the ISO was broken 
and replaced by a link to a special “Bootstrap 
ISO” which converted these apparent macro- 
instructions into lines of tables, namely, the 
three tables of interest. Thus the process of 
table preparation was reduced to one of writing 
down statements in a family of readable lan- 
guages. 

THE LANGUAGE BNF 

The syntax of the source language is de- 
scribed in a language called BNF (to suggest 
“Backus normal form,” denoted B.n.f.). BNF 
looks much like B.n.f. to within the limitations 
of the available character set and a desire for 
legibility. Thus syntactic type names are given 
as identifiers (upper case alphanumerics), the 
sign “ : :=” is replaced by “=”, and the sign 
“I” by “/”; literal symbol strings are headed by 
“$” and delimited by “/” or blank. Within a 
symbol string “$” acts as a control character 
and forces inclusion of the next character as 
part of the string. Thus, as long as the symbol 
strings do not include “$”, “/”, or blank, every- 
thing is quite readable; if they do, legibility 
drops accordingly. 


Examples : 

B.n.f. 

<arex> : : = <term> | <arex> <adop> <term> 
<relop> : : = GE | LE | UE | EQ 
<mulop> :: = *\/ 


BNF 

AREX = TERM/AREX ADOP TERM 
RELOP = $GE/$LE/$UE/$EQ 
MULOP = $*/$$/ 


To each BNF type definition may also be appended tags which provide control information 
associated with the type. 


THE LANGUAGE GSL 

A statement in the generation strategy language GSL begins with a predicate to be satisfied, 
of the form : 

IF <typename> AND (ti) AND . . .AND (t„). 


where the are assertions whose truth is to be 
tested and the L are relative tree names. The 
are assertions about the presence or absence 
of a node, its S3mtactic type, the number of its 
“sons” (components), and so on. 


Following this predicate is a sequence of 
commands to be associated with nodes of the 
distinguished kind. A command is either a 
directive to proceed to another node or an action 
of some sort (emit output, for example). 
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Example: 

IF AREX AND SONS = 0 AND 
FATHER*FATHER*RTSIB IS ADOP, 
$LFTSIB $SONl $OUTPUT 
(PLUS,SONl,FATHER*RTSIB) . 

The list of commands is in effect a small pro- 
gram to be attached to a node when its kind is 
first determined. It is an odd program, in that 
each node keeps its own program counter. The 
generator may consider a node several times. 
The first time, its kind is determined, its gen- 
eration strategy “program” attached, and the 
first command of the program executed. Upon 
subsequent reconsiderations of that particular 
node, the first command not yet executed is in 
fact performed. There are appropriate pro- 
cedures for handling ill-defined commands (if 
a command directs walk to a non-existent node, 
take the next command; if there are no more 
commands, go to FATHER, and so on). 

In the above example, upon first entry, the 
generator will just move consideration to the 
“left sibling” of the current node; at second 
entry, to the first son ; at third, a PLUS macro 
will be emitted ; afterward, any entry will cause 
automatic transfer to the father of this node. 


This exception is readily detectable by the 
BNF/GSL programmer, and thus does not vio- 
late the general philosophy. 

The language GSL includes a number of com- 
mands besides “OUTPUT.” These are con- 
cerned with passing information from node to 
node of the tree and with the emission of con- 
trol messages to the ISO. 

THE LANGUAGE MDL 

The Macro Description Language provides a 
means of completely defining the semantics of 
the macro-instructions emitted by the genera- 
tor. The definition contains the information 
necessary for controlling the processing of a 
macro by the ISO and the code selector, in- 
cluding a definition of the instruction reper- 
toire and special registers of the “object” com- 
puter. 

The most significant part of MDL is the sub- 
language CSL (Code Selection Language). CSL 
permits the writer of a strategy to describe deci- 
sion rules for the selection of machine instruc- 
tions in terms of the status of macros (and 
their dependents : arguments, arguments of 
arguments, etc.) and the status of computer 
registers. 


Obviously, for a language of this sort — ^built 
around the relative tree name idea — ^to work at 
all, it is essential that the GSL programmer 
know in precise detail what tree structure will 
be output by the Analyzer. Thus the compo- 
nents of a type as given in the BNF and the 
sons of a node are in perfect one-one corre- 
spondence. The single exception is that left 
and right recursives come out as a “bush” of 
components at the same level ; thus, 

AREX = TERM/ AREX ADOP TERM 

would produce, from “A-fB-j-C”, a bush of the 
form 



To illustrate a few of the properties of CSL 
and the code selector, assume the compiler is 
translating an algorithmic language with sim- 
ple arithmetic assignment statements and has 
analyzed and generated over the statement 

A B * C + A 
producing the following macros : 

(1) B, C 

(2) +, (1),A 

(3) A, (2) 

Assume that the computer has an accumu- 
lator (ACC) , an addressable memory (M) and 
five single-address instructions : 

CLA M : (M) ACC 

ADD M : (ACC) + (M) ^ ACC 

MPY M : (ACC) * (M) ^ ACC 

RAD M : (ACC) -|- (M) ACC, M 

STO M : (ACC) M 

Following the principle that no code should 
be generated for a macro until the intended use 
of the result of the macro is known, the code 
selector would begin execution of the selection 
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strategy for macro line (3), the strategy for a 
macro. 

For this example the strategy would recog- 
nize that the RAD instruction was applicable 
and, after deciding on the execution of macro 
line one and verifying that its result is in the 
ACC, would emit an RAD A instruction. 

The strategy for the on macro line one 
would produce CLA B followed by MPY C. 

Hence the code produced would be 
CLA B 
MPY C 
RAD A 


To give some idea of the appearance of a 
code selection strategy we append a set of state- 
ments in CSL. Each line has been numbered to 
facilitate referencing a set of explanatory 
notes. Also refer to the descriptions of EXE- 
CUTE, OUTPUT, and ALLOCATE which ap- 
pear immediately after the strategies. 

In the notes, the following symbols are 
used: 

Q/ variable 
£ expression 

ft) operation (either or “x”) 


FOR 

1 . 

2 . 

3. 

4. 

5. 

6 . 

7. 

8 . 

9. 

10. 

11 . 

12 . 

13. 

14. 


IS ARG2 = “+" MACRO 
begin IS ARGl (ARG2) = ARGl 
begin Cl = ARG2(ARG2) 

GO TO ALPHA 
IS ARG2 (ARG2) = ARGl 
begin Cl = ARGl (ARG2) 

ALPHA.. EXECUT* (Cl) 
ALLOCATE* (Cl TO ACC) 
OUTPUT* ((RAD) ARGl) 
EXIT 

EXECUT (ARG2) 

ALLOCATE (ARG2 TO ACC) 
OUTPUT ( (STO) ARGl) 

EXIT 


end 


end end 


Notes for 

1. Is the second argument (i.e., the value be- 
ing assigned to the first argument) the 
result of a macro? 

2. If so, is the first argument of the 
macro identical to the variable receiving 
the assignment ; in other words do we have 
the form Q/ *- Q/ + 

3. If so, the local variable Cl is set to the 
macro line number for 

4. And trjCnsfer control to ALPHA (line 7). 

5. Alternatively, is the second argument of 

the “-f” macro identical to the variable 
receiving the assignment; in other words 
do we have the case 0/ ^ ^ 

6. If so, the local variable Cl is set to the 
macro line number for £. 

7. ALPHA . . Execute the code selection 
strategy appropriate for the macro on the 


line specified by Cl (i.e., cause tl» evalu- 
ation of £. 

8. Execute the code selection strategy sub- 
routine ALLOCATE to guarantee that the 
result of macro line Cl (i.e., the value of 
£} is in the accumulator. 

9. Output to the assembler RAD (}/. 

10. Exit. 

11. Otherwise (the fail path of 1. or 5.) exe- 
cute the code selection strategy appropri- 
ate for the macro pointed to by the second 
argument. (We hawe liie case Q/ *- £ and 
wish to cause the generation of code to 
evaluate £). 

12. Execute the code selection strategy sub- 
routine ALLOCATE to guarantee that the 
value of £ is in the ^accmni^tm*. 

13. Output to the assembler STO Qf. 

14. Exit. 
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FOR “+” 

1. C2 = (ADD) 

2. BETA . . EXECUTE (ARGl) 

3. EXECUTE (ARG2) 

4. ISARG2INACC 

5. begin Cl = ARGl 

6. GO TO GAMMA end 

7. ALLOCATE (ARGl TO ACC) 

8. Cl = ARG2 

9. GAMMA . . OUTPUT ( (C2) Cl) 

10. EXIT 

Notes for 

1. The local variable C2 is set to the operation 
code for “ADD.” 

2. BETA . . Execute the code selection strat- 
egy appropriate for the macro pointed to 
by the first argument (we have the case 
£i " and wish to cause the generation 
of code to evaluate ) . 

3. Execute the code selection strategy appro- 
priate for the macro pointed to by the sec- 
ond argument (i.e., £- 2 ). 

4. Is the value of £■> now in the accumulator ? 

5. If so, set local variable Cl to £ 1 . 

6. And transfer control to GAMMA (line 9). 

7. Otherwise execute the code selection 
strategy subroutine ALLOCATE to guar- 
antee that £i is in the accumulator. 

8. And set local variable Cl to £,. 

9. GAMMA . . Output to the assembler the 
operation code specified by C2 and the ad- 
dress specified by Cl. 

10. Exit. 

For 

1. C2 = (MPY) 

2. GO TO BETA 

Notes for 

1. The local variable C2 is set to the operation 
code for “MPY”. 

2. And transfer control to BTA (line 2 for 
“-1-” strategy). 

EXECUTE (N) : If N names a macro line this 

action causes the code selection to execute the 


strategy appropriate for the macro on line N 
and then resume where it left off. 


OUTPUT ( ) : This action outputs code to the 

assembler. 



ALLOCATE (N, 

TO No) : 

Ni names a macro 

line and No names a register (or register class) . 
This code selection subroutine causes the code 
selection to guarantee that the value for which 
Ni stands is placed in the register (or register 
class) named by No, saving the contents of the 
register if they are still needed. 

A flow history 

of the 

processing for the 

example would be 

; 


macro type 

line 

resulting output 

<r- 

1 


<- 

2 


<- 

5 


<- 

6 



7 


* 

1 


4: 

2 


+ 

2 


+ 

3 


+ 

4 


+ 

7 

CLA B 

+ 

8 


+ 

9 

MPY C 

+ 

10 


<- 

8 


4 - 

9 

RAD A 

4 - 

10 



STATUS AND EVALUATION 

This compiler and its associated bootstrap 
have been realized on a variety of machines 
(IBM 7090, Burroughs D-825, CDC 1604). The 
compiler has been used to translate from source 
languages JOVIAL, Lo (the algebraic language 
of the CL-I System), and CXA (a BALGOL 
dialect) into several machine languages, includ- 
ing D-825 and 1604. The method of moving the 
compiler from machine to machine may be of 
interest as an indication of the power of the 
technique. 

The compiler itself was originally written in 
language Lo and compiled through the CL-I 
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Programming System into a running program 
on the IBM 7090. Then a deck of BNF/GSL/ 
MDL cards defining the translation from L,, into 
GDC 1604 was input to the bootstrap. After exe- 
cution of the latter program, there existed a 
compiler for translating algorithms written in 
Lk> into an assembly language (meeting the re- 
quirements of the COOP system on the GDC 
1604), operating on the IBM 7090. That com- 
piler was fed the decks of cards in Lo which had 
originally defined the compiler to CL-I. The 
result of this run was a compiler (and boot- 
strap) which could be moved to the GDC 1604. 
Then that version of the bootstrap was fed 
decks in BNF/GSL/MDL which defined the 
translation of CXA into GDC 1604, resulting in 
a CXA compiler on the GDC 1604. 

The compiler is not as fast as some we have 
seen, but does not seem prohibitively slow, 
either. The object code is quite alarmingly 
good; indeed, it is frequently completely un- 
readable by a programmer of normal patience. 
In practice we prepare a BNF/GSL/MDL deck 
which does an adequate job. If we later want 
to improve the code (and are willing to slow 
down the compiler accordingly), we simply 
extend the deck. 


The bootstrap method does not make com- 
piler construction trivial, since code selection 
for a messy machine can be very difficult to 
work out and since contact with the data and 
control environment of the code being compiled 
may be more expensive than the translation 
process. What is now trival is the substantial 
modification of source syntax, minor changes 
in optimization rules, and the like. 

The work described here has been informally 
reported to several agencies over the last year 
under various names, including “G.G.S.” (for 
“compiler generator system”), the “bootstrap 
method,” and the “GOMPASS technique” for 
Gomputer Associates, Inc.). 
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A COMPUTER TECHNIQUE FOR PRODUCING 

ANIMATED MOVIES 


Kenneth C, Knowlton 
Bell Telephone Laboratories, Incorporated 
Murray Hill, New Jersey 


INTRODUCTION 

This paper describes a computer technique 
used for the production of animated diagram 
movies.* This technique — ^as implemented with 
the IBM 7090 computer and the Stromberg- 
Carlson 4020 microfilm recorder^ — involves the 
basic steps of coding and checkout, production 
computer run, and optical printing from the 
master film thus produced. 

Programs are coded in the ‘‘movie language'' 
to be described, a language which has been de- 
veloped entirely within the framework of 
MACE^ FAP.2 They are checked out, without 
producing film, through examination of picture 
samples printed on the standard output printer. 

After checkout, the production run produces 
a magnetic tape which instructs the 4020 in 
exposing a master film, such as that shown in 
Fig. 1. Each frame of this film is made of a 
rectangular array of tiny characters produced 
by the 4020 charactron tube used in the type- 
writer mode. The recording camera is slightly 
defocussed, thereby turning the finely struc- 
tured characters into contiguous blobs of differ- 
ent intensities, depending upon the characters 

*This computer technique is also described in a 17- 
minute 16mm black and white silent movie which was 
produced by the very process which it describes. This 
movie, entitled “A Computer Technique for the Pro- 
duction of Animated Movies”, is available on loan from 
the Technical Information Libraries, Bell Telephone 
Laboratories, Incorporated, Murray Hill, New Jersey. 


used. Each picture thus consists of a rectan- 
gular array of blobs on a raster either 126 wide 
and 92 high or, for finer resolution, 252 wide by 
184 high. Figures 2a, 2b, and 3 of this paper 
were actually made by this system — operating 
in the fine-resolution mode — as if they were to 
appear as scenes in a movie. 

The master film contains only one picture for 
each sequence of identical frames of the final 
movie; a comment above this picture indicates 
the length of the sequence. The “stretching 
out" of the master is done by optical printing 
at a movie laboratory, where standard proc- 
esses are also used for editing, adding a sound 
track, and making work prints and final prints. 

Internxil Representation of Pictures 

The movie programmer imagines that pic- 
tures exist within the 7090 on rectangular sur- 
faces ruled off in squares, each square contain- 
ing a number from 0 to 7. Pictures are created 
and manipulated by changing the patterns of 
numbers in the squares. During output these 
patterns of numbers are interpreted as spots of 
appropriate shades of grey, according to a pro- 
grammer-specified transliteration. Fine-reso- 
lution pictures are produced by “aiming" the 
output routine at a subarea 252 squares wide 
and 184 squares high ; for coarse-resolution pic- 
tures the output routine is aimed at an area 126 
squares wide and 92 squares high. 
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Figure 1. The beginning of a master film, shown here 
larger than actual size. The comment above each frame 
gives the date, the beginning and ending frame num- 
bers of the corresponding sequence in the final film, 
and (redundantly) the length of this sequence. The 
final movie in this case begins with the framing and 
focusing pattern, followed by two seconds (48 frames) 
of black. Then the four title scenes appear, in order, 
followed by one second of black. 


The total storage area within the 7090 corre- 
sponds to two complete fine-resolution movie 
frames. This area may be used in different 
ways: as two independent surfaces, each just 
large enough for one complete movie frame, or 
as one surface twice as wide or one surface 
twice as high. These possibilities are indicated 
in Fig. 2a which gives the names of these sur- 
faces and their sizes in squares. Every square 
of a surface is assigned x and y coordinates, 
the bottom left-hand corner square of every sur- 
face having the coordinates x = 0, y = 0. 

There are still other sizes and shapes of sur- 
faces which can be used for complete coarse- 


resolution frames (or for parts from which 
fine-resolution pictures will be composed by 
copying) . These surfaces, with their names and 



Figure 2a. Names and sizes of surfaces large enough 
to hold complete fine-resolution movie frames (252 x 
184). Alternate uses of total 7090 storage area are 
indicated, including the uses of region AA in ways 
shown in Fig. 2b. 



Figure 2b. Names and sizes of additional surfaces, each 
of which is at least large enough to hold a complete 
coarse-resolution movie frame. 



Figure 3. Representation of a surface with scanner A 
at X == 3, y = 5, and scanner B at x = 1, and y = 2. 
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sizes, are shown in Fig. 2b. Their use precludes 
the simultaneous use of surfaces AA, VV, or CC. 

The Programming Language 

The programmer's basic conceptual frame- 
work includes a number of scanners which he 
imagines to be sitting on various squares of the 
surfaces (see Fig. 3). Scanners can intercom- 
municate, and each can read the number it is 
sitting on, can write a new number into this 
square, and can move right, left, up, or down 
any integral number of positions to a new 
square. Scanners may also convey information 
— by virtue of their positions or the numbers 
on which they sit — ^to the subroutines of the 


movie system. There are 26 scanners in all, each 
named by a different letter of the alphabet. 

The language by which the computer is pro- 
grammed to make movies may be divided into 
two parts. The first and historically older part 
consists of those instructions for drawing and 
changing pictures by explicit manipulation of 
scanners. This part, the ‘‘scanner language," 
may be used for other purposes than movie 
making. It may be used, for example, to draw 
flowcharts or schematic wiring diagrams which 
are represented within the computer by two- 
dimensional arrays of the digits 0 through 7. 

An example of an instruction in the scanner 
language is 


IFANY (B,R,10) (B,A,C) (A,E,7)T(A,T,B) (A,U,2,) (A,W,3) L0C5 


which says that if any of the following is true 

that scanner B is J?ight of x — iO, or 
that scanner B is Above (in a line higher 
than) scanner C, or 

that scanner A is sitting on a number J^qual 
to 7 

Then the following operations are performed 

scanner A moves To the same surface and the 
same square as scanner J?, 
scanner A moves C7p 2 squares, and 
scanner A TFrites the number S, and 

control then goes to the line of coding labeled 
LOC5. If none of the three elementary condi- 
tions is satisfied, no operations are performed 
and control goes to the next line of coding. 

The scanner language permits a large num- 
ber of different elementary tests on the posi- 
tions of scanners and on the numbers on which 
they sit. There are also a large number of ele- 
mentary operations for moving scanners and 
for changing the numbers on which they sit. 
A complete list of these basic tests and opera- 
tions, and the flexible formats in which they 
may be used, is given in Appendix A. A con- 
cise summary of the scanner language appears 
there in Tables A.l and A.2. 

The other part of the movie language may be 
called the “movie language proper." It consists 
of more powerful instructions which in general 
compile into calls to subroutines written pri- 
marily in the scanner language. (These sub- 


routines actually use an additional set of 26 
scanners which the programmer need not know 
about.) Instructions of the movie language 
proper fall logically into three categories: in- 
structions for controlling the output or tempo- 
rary storage of pictures, instructions for per- 
forming drafting and typing operations, and 
instructions for modifying the contents of rec- 
tangular areas. These categories will be dis- 
cussed briefly in turn. 

The outputting of movie frames (i.e., the 
writing of the tape which will control the S-C 
4020) is performed by an internal “camera" 
subroutine. The subroutine has many modes of 
o^Jeration, which are determined by “camera 
settings." One setting “aims" the camera at all 
of or part of a surface; ano'ther specifies how 
many frames of the final film are to be produced 
for every frame of the master film. A third 
setting specifies the transliteration from digits 
0 through 7 to the typeable characters of the 
4020 charactron. Other settings specify what 
output the camera routine should produce on 
the normal printer, for monitoring purposes. 
Facilities are also available for temporarily 
storing entire contents of surfaces on a disc 
file. Up to 440 complete fine-resolution movie 
frames may be stored on the disc at any one 
time. 

The facilities for drafting include the ability 
to draw straight lines, arcs, and arbitrary 
curves. These lines may be made to appear 
instantaneously in the movie, or they may be 
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made to appear gradually as they are drawn. 
Lettering may be done by “typing'' letters of 
six different sizes, the smallest letters each cov- 
ering an area 4 squares wide and 5 high, the 
largest covering an area 15 squares wide and 
21 high. 

Finally, the entire contents of a surface or a 
rectangular subarea may be changed in many 
ways. The area may be “painted" by filling all 
of its squares with one particular digit, or 
another area of similar size and shape may be 
copied into it. The contents of the area may be 
shifted an integral number of positions up, 
down, right, or left, or they may be “expanded" 
(by repeating rows or columns) or “squashed" 
(by deleting rows or columns) in any of the 
four directions. Certain local operations may 
be performed throughout the area, such as add- 
ing incremental layers to regions defined by a 
certain number, or rounding sharp corners. 
There are operations which approximate the 
effect of a zoom lens by enlarging or reducing 
by an integral factor the pattern of numbers 
within a rectangular area while the camera 
routine is putting out pictures of the intermedi- 
ate states of the picture. There is also a facility 
for “dissolving" one picture onto another by 
sprinkling the new numbers onto the old ones, 
again while the camera routine is outputting 
the intermediate states of the dissolve. 

The movie language proper is described in 
detail in Appendix B. A programming example, 
involving instructions both of the scanner lan- 
guage and of the movie language proper is 
presented in Appendix C. This sample program 
was actually the program used to produce Fig. 
2a of this paper. 

Uses of the Movie System 

This movie language may be used to produce 
many types of simple animated movies. It may 
be used, for example, to produce visual displays 
for psychophysical experiments, or to produce 
a more common type of movie such as the ex- 
pository educational film. 

The system may also be used to convert the 
output of computer-performed experiments into 
visual displays. For example, the person experi- 
menting with heuristics for automatic layout of 
printed circuits may wish to watch in a movie 


the computer's attempts to search efficiently for 
wire paths. 

Costs for producing movies by this means are 
low, and movies can also be produced quickly, 
primarily because only a few people are in- 
volved. Simple educational movies cost a few 
hundred dollars l^r minute, with the cost split 
approximately equally three ways: program- 
mer's time and overhead, computer time, and 
standard movie laboratory operations. 

Much of the power of the movie system as 
implemented derives from the fact that it has 
been constructed entirely within the framework 
of MACR0 FAP. The instructions of the scan- 
ner language and of the movie language proper 
are actually macro-instructions. As such, they 
may be interspersed with instructions of the 
basic FAP language or, more important, with 
higher-order macro-instructions which the pro- 
grammer has defined in terms of the original 
movie instructions. Appendix C contains ex- 
amples of such higher-order macros which were 
developed, in some cases because they were 
more powerful, and in others because they more 
exactly matched the requirements of a specific 
job. 
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APPENDIX A 

The Scanner Language 

The scanner language is that part of the 
movie language by which the programmer ex- 
plicitly performs tests and operations on the 26 
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scanners — ^those reading and writing heads 
illustrated in Fig. 3 which scan and operate on 
two-dimensional arrays of numbers. This ap- 
pendix describes the formats and uses of scan- 
ner language instructions; Appendix B de- 
scribes the more powerful movie instructions 
which constitute the ‘'movie language proper.’' 
An example of programming with both kinds of 
instructions is presented in Appendix C. 

Scanner Initialization 

A scanner, before being used, must be initial- 
ized by an instruction with the special format : 

PLACE sc,surf,x,y 
(e.g. PLACE D, BE, 92, 5) 

which has the effect of placing scanner sc on the 
surface surf at coordinates x,y. The scanner 
may be any one of the 26 available, A,B,. . .,Z, 
and the surface may be any one of those illus- 
trated in Figs. 2a. and 2b. The coordinates 
must refer to a square which is actually on the 
surface: the programmer should note that the 
bottom left square coordinates (0,0) and the 
top right square of, say, a 252 X 184 surface, 
has coordinates (251, 183). 

General Instrnction Formats 

Scanner instructions are generally expressed 
in terms of elementary tests on positions of 
scanners and the numbers in the squares they 
are sitting on, and in terms of elementary 
operations directing scanners to move or write 
new numbers into these squares. 

Instructions — or lines of coding — are of two 
basic types : unconditional and conditional. The 


unconditional instruction may be illustrated 
schematioally as 

Symb THEN ( )( )...( )Symb2 

symbol list of elementary goto 

(Optional) operations (Optional) 

and it contains the following parts ; 

1. In the location field of the card, an op- 
tional FAP symbol which is not a single 
letter or double letter (the symbols A,AA, 
B,BB, . . . Z,ZZ have been pre-^empted by 
the movie system). 

2. In the operation field of the card, the 
macro name THEN. 

3. In the variable field of the card, a list 
of operations, followed by an optional 
single symbol indicating where control is 
to go after the operations have been per- 
formed. If there is no goto, control passes 
to the next line of coding. If there is a 
goto, the entire list of operations may be 
missing. The length of the list is lim- 
ited by the restriction that the operation 
and goto must appear on one card from 
columns 16 through at most column 72. 

The conditional instruction is similar in for- 
mat except that preceding the list of opera- 
tions there appears a list of elementary condi- 
tions followed by the delimiter ‘T’, and that 
the name of the macro-instruction indicates 
which of four logical functions on the condi- 
tions must be satisfied in order that the opera- 
tions be performed: 


Symb ' IFxxx ( )( )...( ) T ( )( )...( ) Symb2 

Symbol list of elementary list of elementary goto 

(Optional) conditions operations 


If the list of operations is null, the ‘T’ is omitted 
and a goto must appear. Again, the length of 
the lists is limited only by the requirement 
that the list of conditions must start in column 
16 and the instruction may extend at most 
through column 72 of the same card. 

Each of the 4 MACR0 names requires satis- 
faction of a different logical function of the 
conditions, as follows : 


IF ANY, satisfied if any of the elementary 
conditions is satisfied. 

IF ALL, satisfied if all of the elementary con- 
ditions are satisfied. 

IFN0NE, satisfied if none of the elementary 
conditions are satisfied. 

IFNALL, satisfied if not all of the elemen- 
tary conditions are satisfied. 
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In each case, if the compound condition is satis- 
fied, then the indicated operations are per- 
formed and if there is a goto, control goes to the 
indicated point in the program, otherwise to the 
next line of coding. If the compound condition 
is not satisfied, no operation is performed and 
control goes to the following line of coding. 

In addition to the five basic macro-instruc- 
tions, the following synonyms are built into the 
system : 


ANY, synonym for IFANY 

ALL, 

<< 

IFALL 

N0NE, 

it 

IFN0NE 

NALL, 

it 

IFNALL 

IF, 

it 

IFALL 

N0T, 

« 

IFN0NE 

EITHER, 

« 

IFANY 

B0TH, 

it 

IFALL 

ELSE, 

it 

THEN 


The first four of these are simply abbreviated 
notations to facilitate programming. The 
others enable the program to be more easily 


read and understood. IF and N)5T are sug- 
gested in place of IFALL and IFN0NE, re- 
spectively, where there is just one elementary 
condition. Likewise, EITHER and B0TH are 
suggested in place of IFANY and IFALL, re- 
spectively, where there are two elementary con- 
ditions. ELSE is suggested in place of THEN 
when it follows a conditional instruction that 
has a goto. 

Elementary Conditions or Tests 

An elementary condition is a simple test per- 
formed on the position of a scanner or on the 
number this scanner is sitting on. Every ele- 
mentary condition is written as a triplet of 
arguments separated by commas and delimited 
by parentheses. It has the form 

(scnr,rel, quant) 

where scnr is a single-letter name of a scanner, 
rel is a single letter designating a particular 
relation, and quant specifies either directly or 
indirectly, the quantity or coordinate involved 


TABLE A.1 


Elementary Scanner Conditions and Their Formats 


a,P 

scanner : A,B,C, . . . Z 

n(a) 

the number a is sitting on 

K (n(a)) 

bit-by-bit complement of n (a) * 

X(a) 

abscissa of a: 0,1,2 . . . 

Y(a) 

ordinate of a : 0,1,2 . . . 

n 

a decimal number (if n is a number in a square, 0 ^ n ^ 7) 


**Quantity'* is a scanner name ''Quantity” is a number 


1. Tests on position of scanner a. Is it true that a is: 


at X O) 

(a,X,^) 

&tX = n 

(a,X,n) 

at Y ip) 

(a,Y,^) 

at Y = 71 

(a,Y,n) 

to the i2ight of X (j3) 

(a,R,/?) 

to the Right of X = ti 

(o,R,n) 

to the Left of X (^) 

(a,L,/3) 

to the Left of X = 

(a,L,n) 

Above Y ip) 

(a,A,)8) 

Above Y — n 

(a,A,n) 

Relow Y (/3) 

(a,B,^) 

Below Y = n 

(a,B,n) 

2. Tests on the number scanner 

a is sitting on. 

Is it true that n(a) 


is Equal to n(/8) 

(a,E,/?) 

is Equal to the number n 

(a,E,n) 

is Not equal to n (j8) 

(a,N,^) 

is Not equal to n 

(a,N,n) 

is Smaller than n (;8) 

(a,S,^) 

is Smaller than n 

(a,S,n) 

is Greater than n (/3) 

(a,G,)8) 

is Greater than n 

(a,G,n) 

contains all Zero bits of n (/3) * 

(a,Z,^) 

contains all Zero bits of n* 

(o,Z,n) 

contains all One-bits of n (y8) * 


contains all One-bits of n* 

(a,0,n) 


With numbers expressed in binary notation. 
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in the test. If qtuint is a number it specifies the 
quantity directly; if quant is a letter than it 
specifies the scanner whose number or position 
is involved in the test. A complete list of tests 
and the letter by which they are designated ap- 
pears in Table A.l. 

Elementary Operations 

An elementary operation, like an elementary 
condition, is written as a triplet of arguments, 
separated by commas and delimited by paren- 
theses, and has the form 

(scnr,op,quant) 


Here senr is the single-letter name of the scan- 
ner which performs the operation, op is a single 
letter designating the operation to be per- 
formed, and quant generally specifies directly 
or indirectly a quantity involved in the opera- 
tion : as in elementary tests, a number specifies 
the quantity directly, whereas a letter specifies 
the scanner whose number or position is to be 
used. One exception is the operation 

(a,Z,^) 

which specifies that both scanners o and 13 are 
to exchange numbers, i.e., each writes the num- 
ber that the other was just sitting on. 


TABLE A.2 

Elementary Scanner Operations and Their Formats 
(Key : same as for Table A.1) 

“Quantity” is a scanner name “Quantity” is a number 

1. Operations for moving a: 


To surface and position of 

ia,T,/3) 



horizontally to Z(j8) 

(a,X,)8) 

horizontally to A = n 

(a,X,n) 

vertically toY(fi) 

(a,Y,^) 

vertically to Y = n 

(a,Y,n) 



Up n squares 

(a,U,n) 



Down n squares 

(a,D,n) 



flight n squares 

(a,R,n) 



Left n squares 

(o,L,n) 

Move one square according to n {^) * 




2. Operations for changing the number a is sitting on, by: 


FFriting the number n (y8) 

(a,W,y8) 

Writing the number n 

(a,W,n) 

Writing K 

(a,K,^) 



exchanging n (a) and n (y8) 

(a,Z,;8) 



bit-by-bit 0Ring by n (jS) 

(a,0,y3) 

bit-by-bit 0Ring n 

(a,0,n) 

bit-by-bit ANDing by n (^8) 

(a,A,y3) 

bit-by-bit ANDing n 

(a,A,n) 

adding n(/8)** 

(a,E,^) 

adding n** 

{o,E,n) 

subtracting n (;8) ** 

(a,F,yS) 

subtracting n** 

(a,F,n) 

multiplying by n (fi) ** 

(a,G,^) 

multiplying by n** 

(a,G,n) 

dividing by n ()8) ** 

(a,J,y3) 

dividing by n** 

(a,J,n) 

Getting a’s memory to n (^) 


Setting a’s memory to n 

(a,S,n) 


* Step up if n(^) = 4, step right if n(;8) = 5, step down if n(/3) = 6, step left if n(/3) = 7, otherwise no motion. 
** Result reduced modulo 8. 


A list of elementary operations appears in 
Table A.2. It should be noted that certain oper- 
ations require that the quantity iftvolved al- 
ways be indicated directly as a number, where- 
as certain other operations require the quantity 
to be specified indirectly by the name of a 
scanner. 


It should also be noted that there is no par- 
ticular relation between the interpretation of 
a specific triplet as a test and the interpreta- 
tion of the same triplet as an operation. For 
example the triplet 

(B,A,6) 




74 PROCEEDINGS-^SPRING JOINT COMPUTER CONFERENCE, 1964 


interpreted as a test means ‘‘Is scanner B above 
y = 6T\ whereas as an operation it means 
“The number that scanner B is sitting on should 
have ANDed onto it the number 6 (i.e., its low 
order bit should be forced to zero).” Whether 
a triplet is to be interpreted as a test or an 
operation is determined by its position in the 
line of coding. 

The available operations permit scanners to 
be moved beyond the limits of their surfaces as 
defined by Figs. 2a and 2b; special considera- 
tion should be given to the results of such oper- 
ations. The surfaces of Fig. 2a act as helices, 
such that a single step “right” from the right 
column of the surface places the scanner on the 
leftmost column of that surface but one row 
below where it started. Conversely, a step left 
over the edge places it on the right edge one 
row above where it started. Motion above and 
below the top and bottom edges is legal, but the 
scanners will perform a “no-operation-and-con- 
tinue” instead of altering any “numbers” that 
they might be sitting on there. Furthermore, 
the programmer should be aware that the 
y-coordinate is treated modulo 2^^ so that after 
a step down from y = 0, a test on its position 
will result as if the scanner were at y = 32,767. 

The connectivity of the surfaces of Fig. 2b 
is similar to that of surfaces in Fig. 2a except 
that after stepping over the right edge of the 
surface, a scanner enters a no-man's-land a few 
squares wide; after successive steps through 
this region, the scanner appears, as in the case 
of the other surfaces, on the left edge, one row 
below the one it started on. 

Another precaution regarding the use of the 
operations of Table A.2 concerns the case in 
which two or more scanners are sitting on the 
same square, as they would be, for example, 
after an operation of the form 

(aXP) 

If one of the scanners changes the number on 
the square in any way, the other scanners on 
that same square do not “know” that the num- 
ber has been altered, and subsequent tests or 
operations involving the number under one of 
the other scanners may yield an erroneous re- 
sult. In general a scanner updates its memory 
of the number it is sitting on only when it 


moves to a square or when it itself changes this 
number. “Moving” a scanner a to its own posi- 
tion by the operation 

(a,T,a) 

will always properly update a’s conception of 
the number it is sitting on. 

Finally, a scanner's memory may be deliber- 
ately set to a number which has no relation to 
the number it is sitting on by the last operation 
of Table A.2 : 

(a,S,^) or (a,S,n) 

Subsequently, and until another operation is 
performed with scanner a, all tests and opera- 
tions involving a result as if a were sitting on 
n(^) or on n. This provides a useful way of 
passing information (numbers) to subroutines 
without actually having to write these numbers 
on a surface. 

Subroutines 

In order to perform a subroutine beginning 
at symbolic location sub, the special triplet 

(QQ,P,sub) 

is inserted as an ordinary triplet in a list of 
elementary operations. This special operation 
saves on a pushdown list the location at which 
the program was operating, and it transfers 
control to the subroutine. 

Exit from a subroutine is accomplished by 
use of the special goto which is identically 

QQ 

Since subroutine returns are recorded on a 
pushdown list, a subroutine may use itself, pro- 
vided that the programmer has in some way 
prevented indefinite recursion into the routine. 

Double-letter ''scanners'^ 

In addition to the 26 scanners A through Z, 
there is a special double-letter scanner sitting 
on the upper right hand corner of each surface. 
The name of each such special “scanner” is 
identical with the name of the surface. These 
scanners may not be moved but they may be 
used to write numbers on their particular 
squares. Their names may also be used as the 
“quantities” in tests and operations on regu- 
lar scanners. For example, there is a scanner 
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BB sitting on the upper right hand comer of 
surface BB at (251,183) and the instruction 

IF (A,X,BB)T{A,T,BB) 

would function the same as the instruction 

IF (A,X,251)T(A,Y,183) 

The double-letter scanners may also be used on 
the same basis as single-letter scanners for 
specifying areas to which the higher order oper- 
ations of Appendix B are to be applied. 

APPENDIX B. THE MOVIE LANGUAGE 
PROPER 

In addition to the scanner instructions of 
Appendix A, the movie programmer may use the 
more powerful instructions of the “movie lan- 
guage proper,” described below. These are, in 
general, macro-instructions which compile into 
calls to subroutines which themselves are writ- 
ten mostly in the scanner language. 

The movie instructions fall naturally into 
four categories, including instructions for 

(1) controlling output of pictures and tem- 
porarily storing pictures and retrieving 
them from the disc file, 

(2) performing drafting and typing opera- 
tions, 

(3) performing “instantaneous” operations 

on the contents of rectangular afea or 
surfaces, and 

(4) performing “dynamic” operations on the 
contents of rectangular areas or sur- 
faces. 

An instantaneous operation is one which is 
performed and completed between output of 
adjacent frames of film, whereas a dynamic 
operation is one which is performed gradually 
while several frames of pictures are being out- 
put by the “camera” output routine. 


These four groups of macro-instructions will 
be discussed in turn. The format of each in- 
struction will be illustrated and described in 
terms of dummy arguments and in most in- 
stances an example of the use of the instruc- 
tion will be given. A resume of all macro- 
instruction formats is given in Table B.l, which 
also contains a list of the more common dummy 
arguments used to describe these instructions. 

1. Instructions for Output and Temporary 
Storage 

An output routine or “camera” within the 
7090 is used to write information on the mag- 
netic tape which is later used to direct the 
S-C 4020 in exposing film. The camera routine 
is initiated by the instruction 

CAMERA n (n optional) 

(e.g. CAMERA 3) 

where n is the intended number of identical 
frames to be produced in the final film. Only 
one frame is produced by the 4020, with the 
number n printed just above this frame. If n 
is not specified in the CAMERA call, then the 
number used is that last specified by the setting 

FRAMES n 
(e.g. FRAMES 2) 

This setting is useful for controlling the appar- 
ent speed at which dynamic operations are per- 
formed, since the subroutines of the system 
which perform dynamic operations contain 
CAMERA calls without specification of n. In 
the event that the specified or effective n is 
zero, the camera call is ineffective, and no pic- 
ture is output. 

Besides the FRAMES setting, there are sev- 
eral other settings which control the operation 
of the camera routine. Camera settings, to be 
discussed in turn, include 


FRAMES 

AIM 

FINE or C0ARSE 
FILTER 
SAMPLE 
LINES 

FILM or N0FILM 


(how many identical frames in final movie?) 

(what surface area to output?) 

(what resolution?) 

(what transliteration during output?) 

(how often to monitor results on printer?) 

(which lines of picture to print when monitoring?) 
(film output, or just monitor output?) 
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TABLE B.l 

Resume of Movie Macro-instructions 


Key (see text for meanings of dummy arguments not listed in the key) 
scTR scanner sitting on Top /?ight corner of affected rectangular area 

scBL scanner sitting on Bottom Left corner of affected rectangular area. If 

scBL = ‘O' then (0,0) of scTR's surface is implied 

sc,scl,sc2, . . . scanner names 


surf 


name of a surface 


mode 


ns,nsl, I 
ns2, . . . / 

n,nl,n2, . . . 


stands for one of the following arguments designating the three different 
ways of changing numbers on the surface : 

WRITE (replace old number by new one) 

AND (bit-by-bit logically AND old and new numbers) 
j^R (bit-by-bit logically 0R old and new numbers) 

indicate numbers to be used or the names of scanners sitting on the num- 
bers to be used 

numbers 


width 


a number from 1 to 6 designating the width in squares of a line, arc, curve, 
or border to be drawn 


dir 


stands for one of the four directions : UP, RIGHT, DOWN, or LEFT 


orient 


amt 

fctr 


speed 


stands for one of the eight basic reorientations : 


ST 

standard orientation 

90R 

rotated 90"" right (clockwise) 

90L 

rotated 90° left 

180 

rotated 180° 

X 

reflected through x axis 

Y 

reflected through y axis 

YEX 

reflected through line y = x 

YEMX 

reflected through line y = -x 


the number of squares of shift, rotation, etc. 

an integer from 2 to 6 specifying the factor of magnification, reduction, 
“stretch" or “press" 

a number specifying the “speed" at which a line is drawn, i.e., the number 
of squares it advances between successive calls of CAMERA (if this num- 
ber is very large, the entire line appears instantaneously) 


1. Instructions for output and temporary storage 


CAMERA 

n 

(w optional) or CAMERA UNTIL, n 

FRAMES 

n 

(O^n) 

FILTER 

n 

(O^n^lO) 

TABLE 

n,n0,nl,n2,n3,n4,n5,n6,n7 

AIM 

SC 


SAMPLE 

n 


LINES 

n 

{n is 11 octal digits) 
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FILM 

N0FILM 

UNTIL n,goto 

RESET n 

COARSE 
FINE 

ST0RE surf, where (where is NEXT,PREV, a number, or missing) 

RETREV surf, where (where is NEXT,PREV, a number, or missing) 

2. Instructions for drafting and typing 

(The entire scanner language of Appendix A may be considered to be in this category) 
LINE scl,sc2,mode,ns,width,speed 

ARC scl,center,d,mode,ns,width,speed,tl,ql,t2,q2 (t2,q2 optional) 

TRACE s 3 Tiibl,length,sc,orient,mode,ns,width,speed 

TYPE symb2,sc,size,Hspace,Vspace,mode,ns 

symbl 0CT n,( specification of curve to be drawn by TRACE) 

symb2 BCI n, (specification of text to be typed by TYPE) 

3. Instructions for instantaneous operations on rectangular areas 

PAINT scTR,scBL,mode,ns 

BORDER scTR,scBL,width, mode,ns 

SHIFT scTR,scBL, dir, amount 

R0TATE scTR,scBL,dir,amount,n (n optional) 

EXPAND scTR,scBL,dir,repl,rep2 

SQUASH scTR,scBL,dir,del,kp 

C0PY scTR,scBL,mode,orient,sc3,sc4,n (n optional) 

CENTER scTR,scBL 

GR0W scTR,scBL,nsl,ns2,ns3,goto (goto optional) 

SM00TH scTR,scBL 

4. Instructions for dynamic operations on rectangular areas 

DIS0LV scTR,scBL,sc3,pat 

Z00MIN scTR,scBL,fctr 

REDUCE scTR,scBL,fctr 

STRECH scTR,scBL,dir,fctr 

PRESS scTR,scBL,dir,fctr 

These settings must be made before the first 
CAMERA call, but they may be changed at any 
later point in the program. 

The surface area to be output is determined 
by the position of that scanner sc specified in 
the last previous setting of the form 

AIM sc 

(e.g. AIM BB) 

(or AIM A) 

The top right corner of the picture which is 
output by a CAMERA call is the current posi- 
tion of that scanner specified by the AIM set- 


ting. If the scanner moves, the camera tracks 
the scanner. 

The output mode (coarse or fine) is deter- 
mined by which of the settings, 

FINE 

or C0ARSE 

occurred last. The setting FINE specifies that 
henceforth and until encountering the next 
COARSE, the area to be output is a rectangular 
array of squares 252 wide and 184 high. The 
setting C0ARSE specifies that henceforth and 
until the next FINE, the area to be output is 
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an array 126 wide and 92 high. In this case, 
the spots are displayed at twice the spacing for 
FINE output in order to fill the complete movie 
frame. 

It will normally be the case that the picture 
on film is to be composed of other charactron 
characters than the digits 0 through 7. This 
requires transliteration during output, specified 
by the setting 

FILTER n (0 n ^ 10) 
(e.g. FILTER 5) 

where n is either 0, specifying no translitera- 
tion, or a number from 1 to 10 specifying one 
of ten available transliteration tables. Each 
table thus used may be set up or later changed 
by a command 

TABLE n,n0,nl,n2,n3,n4,n5,n6,n7 
(e.g. TABLE 5,60,15,14,72,13,16,54,53) 

which causes table n to transliterate 0 into nO, 
1 into nl, 2 into n2, etc. The characters which 
may be used for output on the charactron are 
any of the sixty-four octal characters 0 through 
(77)8 except (12)8, (52) s and (56)8. The spe- 
cific example above gives a recommended grey 
scale, transliterating 0 into blank (60 )h, 1 into 
apostrophe (15)8, 2 into quote (14), 3 to degree 
sign (72)8, 4 to equal sign (13)8, 5 to delta 
(16)8, 6 to asterisk (54)8, and 7 to dollar sign 
(53)8 which is the darkest typeable character 
on the charactron. 

In addition to film output, printed output 
may be produced for monitoring purposes. The 
setting 

SAMPLE n 
(e.g. SAMPLE 24) 

says that printed output is to be produced for 
every n*’* frame of the final movie, except that 
this output will be produced at most once in 
any one CAMERA call. Thus if the sampling 
rate is 24 and the instruction, CAMERA 150, 
is encountered, only one printed output is pro- 
duced during this operation, labeled with the 
beginning and ending frame numbers of the 
corresponding sequence of identical frames in 
the final movie. The part of -the frame which 
is to appear in the printed output is predeter- 
mined by the bit pattern of the 11-digit octal 
number, n in the setting, 


LINES n 

(e.g. LINES 1400000003) 

The first octal digit is 1 or 0, stating that lines 
X = 91 and x = 90 of a COARSE picture should 
or should not be printed, each successive hit 
states whether or not the next S lines should be 
printed. In the example given, only the top 5 
lines and the bottom 6 lines would be printed. 
If the output mode is FINE, then only the odd 
columns of the corresponding odd-numbered 
rows are printed. If the above sample LINES 
setting were used for fine-resolution output, 
then odd-numbered positions of rows 183,181, 
179,177,175,11,9,7,5,3 and 1 would be printed 
(counting the bottom line as line zero) . 

The actual production of film is enabled or 
disabled by the instructions 

FILM 

or N0FILM 

each of which compiles into a single machine 
instruction. Common practice is to begin every 
program with this sequence of instructions, and 
to test the program in this form, producing only 
the printed output for monitoring purposes. 
When the program has been checked out, a pro- 
duction run is performed in which a correction 
card replaces the N^FILM instruction with a 
Nj^P machine instruction. 

The system contains a counter which counts 
frames of the final movie. This counter may 
be interrogated, and flow of control directed 
by the branch 

UNTIL n,goto 
(e.g. UNTIL 2400, AGAIN) 

which causes control to go to the indicated 
goto if the current frame count is below the 
specified number n, A special format for the 
CAMERA call also uses the frame counter 

CAMERA UNTIL,n 
(e.g. CAMERA UNTIL, 2496) 

This call directs the camera routine to produce 
one frame, as does a normal call, but in this case 
the number in the frame line — specifying how 
many times this frame is to be repeated in the 
final movie — is made just large enough to bring 
the frame count up to the specified n. The 
frame counter may be reset by the program to 
any n by the command 
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RESET n 
(e.g. RESET 4800) 

Operations and tests involving the frame 
counter are intended primarily to facilitate 
synchronization of the movie with a sound 
track which is added later by traditional 
methods. 

In addition to putting out pictures on film, 
contents of entire surfaces may be temporarily 
stored on and retrieved from the disc file. There 
are 440 available storage areas on the disc, 
numbered 1 through 440, each capable of stor- 
ing the entire contents of any surface except 
surfaces VV or CC. Storing of surface surf 
is accomplished by the instruction 

ST0RE surf, where (where is 

n,NEXT,PREV, or 
null) 

(e.g. ST0RE AA,150) 
and the picture is retrieved by the instruction 

RETREV surf,where (where is 
n,NEXT,PREV, or 
null) 

(e.g. RETREV AA,PREV) 

In either case, the storage area where may be 
specified explicitly by a number, or implicitly 
by three other possibilities : if where is missing 
in the call, the last area used in a ST0RE or 
RETREV command is used ; NEXT implies the 
next higher area than the last one used, and 
PREV implies the next lower area than the 
last one used. The surfaces VV or CC can be 
effectively stored by storing the contents of 
both AA and BB, since the latter occupy the 
same internal 7090 storage space. For example, 
contents of surfaces VV (or CC) may be stored 
in areas 150 and 151 by the sequence 

ST0RE AA,150 
ST0RE BB,NEXT 

and they may then be retrieved by the sequence 

RETREV BB 
RETREV AA,PREV 

One precaution must be taken after retrieval 
of a picture: a scanner a now sitting on this 


surface may not act as if it were on the corre- 
sponding new number until it is first “moved” 
by some such scanner instruction as (o,T,a) or 
(a,X,a) or (a,Y,a). 

2. Instructions for Drafting and Typing 

The instructions for drafting have the names 
LINE, ARC, and TRACE; the one instruction 
for typing is called TYPE. The^ are all dy- 
namic operations in the sense that while each 
is being executed it is interrupted periodically 
to allow the camera routine to output pictures. 
For the TYPE instruction, the interruption 
occurs after every large character (made up of 
a rectangular matrix of numbers) has been 
typed, and the camera routine may be rendered 
ineffective by a previous setting ‘FRAMES O’. 
For the drafting operations, the interruption 
occurs every time the line being drawn has ad- 
vanced another n squares, where n is specified 
in the instruction as the “speed” at which the 
line is to be drawn. In this case, the interrup- 
tion is avoided, or it is rendered ineffective, by 
a very high speed (e.g. 5000) or by a previous 
setting ‘FRAMES O’. 

An approximation to a straight line is drawn 
from scanner scl to sc2 by the instruction 

LINE scl,sc2,mode,ns,width, 
speed 

(e.g. LINE A,B,WRITE,5,3,9) 

% 

The line is drawn by using the number ns (or 
the number that ns is sitting on if ns is a scan- 
ner name) :* if mode is WRITE, this number is 
used to replace the numbers on affected squares, 
whereas if mode is 0R or AND, the new num- 
ber is 0Red or ANDed, bit by bit, with the 
previous number in each affected square. The 
line is drawn by an imaginary stylus which 
moves by stepping either horizontally or ver- 
tically to the adjacent square which lies closest 
to the ideally straight line. On each square 
thus traversed, an approximately circular dot 
of diameter ividth is centered, and a picture is 
produced after advancing each n squares along 
the line, where n is the desired speed. 

An arc is drawn in a manner similar to the 
drawing of a line, by the instruction 


optional 

ARC scl,sccent,d,mode,ns,width,speed,tl,ql,t2,q2 
(e.g. ARC A,B,CCW,0R, 6,5,12, Y,B,R,B) 
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where the arguments mode, ns, width, and speed 
have the same meaning as for LINE. The arc 
begins at the position of scanner scl, and pro- 
ceeds with scanner sccent as its center, in direc- 
tion d, which is CW for clockwise or CCW for 
counterclockwise. The arc is terminated when 
the drawing stylus, which itself may be thought 
of as a scanner, satisfies the joint condition, 
expressed in scanner language, 

{stylus, tl ,ql) {sty\us,t2,q2) 

or when it satisfies simply the first condition if 
the second condition is not given in the ARC 


instruction. If the terminating condition is 
not satisfied within 1000 elementary steps, the 
program stops. In the specific example above, 
an arc of width 5 is initiated at scanner A and 
is drawn about scannerB as center in a coun- 
terclockwise direction at speed of 12, by 0Ring 
the number 6 onto affected squares. The arc 
is terminated when the drawing stylus is at the 
same height as scanner B and right of scan- 
ner B. 

A curve of arbitrary shape may be traced by 
an instruction of the form 


TRACE 
re.g. TRACE 

1 CURVE? 0CT 


Symb,length,sc,orient, mode, ns, width, speed 
CURVE7,19,A,ST,WRITE,2,1,10 ^ 

000011122233,455667000000 i 


where mode, ns, width and speed have the same 
meanings as for LINE. The argument length 
is the length of the curve to be drawn, ex- 
pressed in elementary steps from one square 
to the next, sc is the scanner at which the 
curve is to start, and orient is one of the eight 
possible reorientations of the basic curve (see 
the Key in Table B.l) . Symb is a FAP symbol 
indicating the location at which a description 
of the basic curve is given in terms of a 
sequence of incremental steps. Each step is 
here specified by one of eight octal digits which 
stand for the eight possible directions for 
these steps: 



V 

( 

y 

) 




6< — 

> 

vl 

\y' 

|\ 

—^2 



5 

> 

1 


/ 

CO 



In the specific example above, if the original 
surface contained only zeros, and if on output 
zeros are transliterated to blanks and 2’s to 
dots than the result would be 


position of 

scanner A ► . 

If the same curve (with the same description) 
had been drawn in the YEMX orientation (re- 
flected through the line y = — x) it would have 
started with a straight section going left and 
then would have spiralled counterclockwise. 

The operation of “typing” in the movie lan- 
guage is done by affecting appropriate patterns 
of squares. The general form of the typing 
instruction is 


TYPE symb,sc,size,Hspace,Vspace,mode,ns 
re.g. TYPE TEXT5,B,5x7,l,3,WRITE,6 1 


TEXT5 BCI 
BCI 
BCI 
BCI 


3,*( BELL 

3,*/ *TELEPH0NE 
3,*/ ’LABORATORIES, 
3,*/ *INC. * 
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where symb is a FAP symbol identifying the 
description of the text to be typed, se is the 
scanner specifying the position of the bottom 
left hand comer of the first character to be 
type, and size is one of the following sizes of 
characters which may be typed : 

4x5 (4 squares wide, 5 high) 

5x7 (5 squares wide, 7 high) 

7x10 (7 squares wide, 10 high) 

8x11 (8 squares wide, 11 high) 

10x14 (10 squares wide, 14 high) 

15x21 (15 squares wide, 21 high) 

The argument Hspace specifies the horizontal 
spacing in squares between the characters of 
a line, whereas Vspace specifies the number of 


squares of vertical spacing between successive 
lines. The mode may be AND, 0R, or WRITE 
as with the drafting operations, but the pro- 
grammer must note that only those squares 
corresponding to the body of the letter are 
affected : background areas, such as the center 
of a ‘0’ or the entire area of a blank are un- 
changed regardless of the mode used. The argu- 
ment ns is either the number to be used in 
changing affected squares, or the name of 
scanner sitting on the number to be used. The 
description of text to be typed is written out 
on FAP BCI cards. All of the 48 standard IBM 
Hollerith characters may be typed. The follow- 
ing special sequences, all starting with **’, have 
the indicated meanings for the typing routine: 


*. stop typing 

* ( shift to upper case (affects typing of letters only) 

* ) shift to lower case ( affects typing of letters only ) 

*L where L is any letter: Type this letter in upper case, then shift to lower 

case 

* return carriage : Return to starting point of previous line, then go down 

letter height plus vertical spacing 
** type the character 


In the specific case of the example illustrated, 
the result would appear approximately as 
follows : 

BELL” 1 

Telephone | 

Laboratories, 37 squares 
Inc. jf 

-4-77 squares.>- 

Other examples of typing operations appear in 
Appendix C. 

3. Instructions for Instantaneous Operations 
on Rectangular ArecLS 

The contents of rectangular areas may be 
altered by any of a large number of instan- 
taneous operations — operations which appear 
to be performed instantaneously in the movie 
because their respective subroutines contain 
no CAMERA calls. The. formats and uses of 
these instructions are described in the following 
paragraphs. In all cases, the rectangular areas 
to be changed are specified by the positions 
of scanners. The dummy argument “scTR” in 
a format statement stands for “scanner defin- 
ing the Top /?ight corner of the area.” The 


dummy argument “scBL” means “either the 
name of a scanner defining the Bottom Left 
corner of the area or else ‘O', meaning x = 0, 
y = 0 of the surface that scanner scTR is on.” 

A rectangular area may be “painted” — every 
square changed by using the same number — 
by the instruction 

PAINT scTR,scBL,mode,ns 
(e.g. PAINT ZZ,0,WRITE,0) 

where mode, as before, is AND, 0R or WRITE, 
indicating whether the change is to be accom- 
plished by ANDing, 0Ring, or replacement, re- 
spectively and ns is either the number to be 
used or the name of a scanner sitting on the 
number to be used. In the example above, the 
entire surface ZZ is “cleared” to zeros. 

A rectangular border of any thickness may 
be produced just within the periphery of the 
rectangular area by the instruction 

B0RDER scTR,scBL,width,mode,ns 
(e.g. B0RDER A,B,9,0R,A) 
where width is the thickness of the desired 
border, mode and ns indicate the manner in 
which numbers are to be changed and the num- 
ber to be used, as in the case of PAINT. 
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The contents of a rectangular area may be 
shifted up, right, down or left any number of 
positions — up to the dimension of the area in 
this direction — ^by the instruction 

SHIFT scTR,scBL,dir,amt 
(e.g. SHIFT AA,0,UP,36) 

where amt is the number of squares of shift 
and dir is the direction of shift, UP,RIGHT, 
D0WN, or LEFT. As a result of the shift 
operation, material is lost at one edge of the 
rectangular area and the “vacated” area is 
filled by repeating the original row or column 
just within the edge that contents of the area 
were shifted away from. 

The material within a rectangular area may 
be “rotated,” as if around a cylinder, by the in- 
struction 

R0TATE scTR,scBL,dir,amt,n ( n optional ) 
(e.g.R0TATE C,D,LEFT,5) 

The effect is the same as for SHIFT except that 
the material which is lost at one edge is intro- 
duced at the opposite edge. Furthermore, if n 
is given, the material which “goes around the 


back” to be reintroduced is transliterated accor- 
ing to transliteration table n. This table must 
have been previously established by a TABLE 
instruction such as is used to specify transliter- 
ation during output, but only the three lowest- 
order bits of the new numbers can be used for 
the R0TATE transliteration. 

The pattern of numbers within a rectangular 
area may be “expanded” upward or downward 
by duplicating certain rows, or it may be ex- 
panded to the right or left by duplicating cer- 
tain columns. The instruction 

EXPAND scTR,scBL,dir,repl,rep2 
(e.g. EXPAND F,G,RIGHT,2,3) 

causes the material to be expanded in the direc- 
tion dir (UP, RIGHT, D0WN, or LEFT) by 
starting at the edge row or edge column oppo- 
site the direction of expansion and repeating 
the first row or column repl times, the next 
row or column rep2 times, the next repl times, 
etc., until the entire rectangular area has been 
refilled. The sample EXPAND instruction above 
would change the pattern 


“(position of scanner Fy 
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3 

0 
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6 

5 

4 

3 

2 

1 
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0 

0 
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7 

7 

6 
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5 

T 
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3 

4 

0 

7 

6 
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-S. 









-(position of scanner G)- 


The pattern of numbers in a rectangular area 
may also be "‘squashed^’ toward one edge by de- 
pleting certain rows or columns and by repack- 
ing the remaining numbers. The instruction 

SQUASH scTR,scBL,dir,del,kp 
(e.g. SQUASH A,B,LEFT,1,2) 

performs such an operation by starting at the 


edge specified by the direction dir and alter- 
nately deleting del columns (rows) and keeping 
kp columns (rows). The remaining columns 
(rows) are closely packed and the vacated area 
is filled by duplicating the original column 
(row) just vrithin the edge from which the mo- 
tion has occurred. The sample instruction above 
would change the pattern 


“(position of scanner A) 
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7 


-(position of scanner B)- 
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A rectangular area may be filled or changed 
by “copying” from another area. This is ac- 
complished by the powerful and versatile in- 
struction 

C^PY scTR,scBL, mode, orient, sc3,sc4,n 
(n optional) 

(e.g. C(^PY A,B,WRITE,90L,C,D) 

Here, as before, scTR and scBL define the af- 
fected area, seS is a scanner on this same sur- 
face (it may in fact be either scTR or scBL) 
and sci is a scanner on the surface from which 
material is being copied. The precise area to be 
copied is visualized as follows : the entire sur- 
face being copied from is reoriented according 
to orient (see Key in Table B.l) and then supe- 
imposed on the surface to be changed in such a 
position that sc3 and sc4 coincide. It is that 
area which now falls on the rectangle defined 
by scTR and scBL which is used : this area is 
first transliterated according to table n if n is 
given, and then it is ANDed, 0Red or written 
into the rectangle, accordingly as mode is AND 
or 0R or WRITE. Peculiar and unexpected 


patterns may result if the two rectangular areas 
involved are -overlapping areas on the same sur- 
face, unless they are exactly the same area and 
orient is ST. The above sample instruction in- 
volves a 90-degree rotation to the left (counter- 
clockwise). Thus, if scanner D is located as 
shown : 


5 6 7 3 2 1 7 

0 ir2 3 4 ] 5 6 

I ' 

0 O' 1 1 2 1 2 3 

I 

4 4 1 5 5 6 ‘ 6 7 /position of scanner D) 


7 6] 5 4 3^ 


1 6'J_2_5J4 0 
4 6 4 7 3 0 2 


the effect would be to change the rectangular 
area 



Lines of typed text, such as lines of a movie 
title, may be centered in a rectangular area by 
the instruction 

CENTER scTR,scBL 
(e.g. CENTER A,0) 

For this instruction to be effective, all squares 
bordering on the edge of this area must contain 
the same numbe, called the “backgound” num- 
ber. The complete background consists of all 
rows containing only the background number 


and, for each horizontal stripe not thus in- 
cluded, it also contains the widest possible rec- 
tangle on the right and on the left which con- 
tain only background numbers. Remaining sub- 
areas thus delimited are called “lines of text.” 
The CENTER operation identifies all lines of 
text, centers each such line horizontally, and 
moves all lines together by a shift-up or shift- 
down to make top and bottom background 
stripes equal. Thus it would change the area 
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Two different local operations can be per- 
formed throughout a rectangular area. Both 
consume a relatively large amount of computer 
time (ca. 10 seconds for the surface A A) ; 
therefore the area to which they are applied 
should be judiciously limited. The first local 
operation, 

GR0W scTR,scBL,nsl,ns2,ns3,goto 
(go to optional) 

(e.g. GR0W XX,0,2,3,5,AGAIN3) 

changes every number nsl which is next to an 
ns^ into an nsS. The number ns^ must also be 
within the area and it must in each instance be 
immediately above, below, right of or left of the 
number to be changed. The numbers nsl, ns2 
and nsS must be three different numbers, and 
each may be either given explicitly or specified 
as the name of the scanner sitting on the num- 
ber to be used. If the instruction contains a 
goto, then on completion of the operation con- 
trol passes to the goto if at least one such num- 
ber was changed. Two GR0W instructions are 
commonly used in a. loop which causes the re- 
gion defined by two numbers gradually to 
"‘grow’’ into the region defined by a third by 
alternately adding incremental layers first of 
one of its numbers and then of the other. If the 
loop contains a CAMERA call, the growth proc- 
ess is seen in the movie. 

The other local operation, which has the 
effect of removing sharp corners, is 

SM00TH scTR,scBL 

(e.g. SM00THA,B) 


The operation proceeds by identifying every 
instance of the four patterns 


n 

n 



n 
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P 

n 


n 
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n 


n 






n 


P 

n 

n 

n 



where all four n’s in a pattern must be the same 
number. For each instance thus found, the p 
is changed to an n. 


4. Instructions for Dynamic Operations on 
Rectangular Areas 

Dynamic operations on rectangular areas are 
those for which a series of pictures are output, 
according to current camera settings, while the 
operation is proceeding. As in the case of in- 
stantaneous operations on rectangular areas, 
the affected area must be designated by two 
scanners (or by one scanner and ‘O', meaning 
0,0 of that scanner's surface) . 


A rectangular area may have another area of 
similar size and shape “dissolved" onto it while 
a series of 36 pictures is being produced. The 
instruction is 

DIS0LV scTR,scBL,sc3,pat 
(e.g. DIS0LV XX,0,ZZ,F) 

where scanner scS indicates the top right cor- 
ner of the area of similar size and shape which 
is to be dissolved onto the area specified by 
scTR and scBL. The dissolve is accomplished 
by dividing the area, which must be at least 6x6 
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squares in sizes, into 6x6 subareas — possibly 
with oddly shaped rectangles at the left and 
bottom. Between any two successive pictures 
of the dissolve, an additional number of the old 
picture is replaced by the corresponding num- 
ber from the new picture in each 6x6 subarea. 
The order in which new numbers are thus in- 
troduced is specified by the parameter pat which 
must be one of the following : 

F fade : quasi-random order 

SI spiral in (simultaneously in all 6x6 sub- 
areas) 

S0 spiral out (simultaneously in all 6x6 sub- 
areas) 

R wipe to right (simultaneously in all six- 
column-wide sections) 

L wipe to left (simultaneously in all six- 
column-wide sections) 

“A “zoom” effect may be approximated by 
gradually magnifying horizontally and verti- 
cally, by an integral factor, the pattern of num- 
bers within a rectangular area. This is accom- 
plished by the instruction 

Z00MIN scRT,scBL,fctr 
(e.g. Z00MIN YY,0,3) 

where fctr is the magnification factor, an in- 
teger from 2 to 6. During the Z00MIN opera- 
tion, the approximate center of the pattern is 
fixed and material is lost off all four edges. 

An approximate inverse of Z00MIN is ac- 
complished by the instruction 

REDUCE scTR,scBL,fctr 
(e.g. REDUCE A,B,5) 

where fctr is the factor of reduction, again an 
integer from 2 to 6. Reduction is accomplished 
by repeated deletions of rows and columns and 
by repacking toward the center those remain- 
ing. Vacated area around the periphery is 
filled in by repetitions of rows and columns 
originally just within the periphery of the area. 

A unidirectional magnification is accom- 
plished by the instruction 

STRECH scTR,scBL,dir,fctr 
(e.g. STRECH A,B,UP,2) 

which holds one edge of the pattern fixed and 
“stretches” the pattern — ^by duplicating rows 
or columns — in the indicated direction until 
( 1/fact) th of the original pattern covers the 


entire area, with each of these rows (columns) 
repeated fctr times.* The direction dir must be 
UP, RIGHT, D0WN, or LEFT, and the factor 
fctr must be an integer from 2 to 6. 

Finally, an approximate inverse of STRECH 
is provided by the instruction 

PRESS scTR,scBL,dir,fact 
(e.g. PRESS ZZ,O,D0WN,3) 

which causes the contents of the area to be com- 
pressed against one side of the area — hy dele- 
tion of rows or columns and repacking in the 
direction dir. The vacated area is filled with 
repetitions of the row or column just within the 
edge from which motion occurs. As before, dir 
must be UP, RIGHT, D0WN, or LEFT, and 
fctr is the factor of compression, and integer 
from 2 to 6. 

APPENDIX C 

An Example of Movie Language Coding 

The actual program which produced Fig. 2a 
is here given as an example of movie language 
coding. It consists of five parts : definitions of 
new macro-instructions, coding for composing 
the picture, coding for outputting the picture, 
closed subroutines, and descriptions of text to 
be typed. Particular attention should be paid 
to the first section, which illustrates how the 
programmer devices his own macro-instruc- 
tions on a still higher level, instructions which 
are either more powerful or which are more 
specifically suited to a particular task. 

A new macro-instruction of wide application, 
BENTLN, is used for drawing any line which 
consists of a series of straight line segments, 
such as the two braces, of Fig. 2a. BENTLN is 
defined in terms of SEGMNT, which compiles 
into a call for the subroutines SEG . . . that 
actually draws one segment of the line. The 
new macro-instruction, SURF, on the other 
hand, was designed specifically for Figs. 2a and 
2b. It permits convenient description of the 
position and size of a surface in the drawing 
and of labels indicating its name and its size in 
squares. 

The program is here listed, with explanatory 
comments on the right. (For more complete 
description of the MACR0 FAP compiler see 
the Bell Telephone Laboratories, 7090 Program- 
mer’s Manual, July 15, 1963) : 
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BENTLN 


* 

SEGMNT 


* 

SURF 


* 


TP(j)R 


MACRjZJ 

THEN 

IRP 

SEGMNT 

IRP 

ENDM 

MACR0 

IFF 

THEN 

THEN 

ENDM 

MACR0 

PLACE 

THEN 

PAINT 

THEN 

TYPE 

CENTER 

IFF 

THEN 

TYPE 

ENDM 


FILM 

FRAMES 

PAINT 

BENTLN 

BENTLN 

PLACE 

TYPE 

PLACE 

TYPE 

PLACE 

TYPE 

IF 

SURF 

SURF 

SURF 

SURF 

B0RDER 

AIM 

FINE 

FILTER 

TABLE 

FRAMES 


XSTART,YSTART, PARTS (STARTING POINT + LIST OF SEG- 
MENTS) 

(C,T,BB) (C,X,XSTART) (C,Y,YSTART) (D,T,C) 

PARTS )C0MPILE SEGMNT FOR EACH PART. EACH “PART” 
PARTS ) IS A LIST 0F A DIRECTION (U,R,D,0R L) AND AN 
AM0UNT PLUS AN 0PTIONAL 2ND DIRECTI0N AND 
AM0UNT. END 0F BENTLN DEF. 

DIR1,AMT1,DIR2,AMT2 (DIR2 AND AMT2 0PTI0NAL) 

0,/CRS/DIR2 IF DIR2 PRESENT THEN 

(D,DIR2,AMT2) C0MPILE THIS LINE 

(D,DIR1,AMT1) (QQ,P,SEG . . .) C0MPILE CALL T0 SEG ... 

END 0F SEGMENT DEFINITI0N 

XI, Yl, WIDTH, HEIGHT, NAME, SIZELN, SHIFT (SHIFT 0PTI0NAL) 
B,BB,X1,Y1 

(A,T,B) (A,R,WIDTH) (A,L,1) (A,U,HEIGHT) (A,D,1) 

A,B,WRITE,2 PAINT RECTANGULAR AREA 

WITH 2’S 

(C,T,B,) (C,U,1) (C,R,1) (D,T,B) (D,D,8) 


NAME,C,8X11,1,1,WRITE,0 

A,B 

0,/CRS/SHIFT 

(D,R,SHIFT) 

SIZELN,D,5X7,1,1,WRITE,7 


BB,(^,WRITE,0 


TYPE NAME WITH O’s 
CENTER IT 

SHIFT SIZE LABEL RIGHT 
TYPE SIZE LABEL 
END 0F SURF DEFINITION 
END 0F DEFINITIONS OF NEW 
MACR0S 

BEGINNING 0F PR0GRAM 
(N0 0UTPUT DESIRED DURING 
TYPING) 

CLEAR SURFACE BB T0 O’S 

46.173 ((L,3,D,2) (D,18) (L,3,D,2I (R,3,D,2) (D,17) (R,3,D,2)) 

95.173 aR,3,D,2) (D,46) (r,3,D,2) (L,3,D,2) (D,40) (L,3,D,2) ) 

C,BB,8,167 (TW0 BRACES NOW D0NE) 

0RSUR,C,5X7,1,2,WRITE,7 TYPE “0R SURFACE . . .” N0TE 
C,BB,10,60 

FIG2A,C,5X7,1,3,WRITE,7 
C,BB,103,125 

TEXTOR,C,5X7,l,2,WRITE,7 
(C,L,120) T (C,X,152) TP0R 

52.140.40.28, TEXTAA, 252X18 

52.90.40.28, TEXTBB,252X18 
1 10,100,40,56,TEXTVV,252X36 

160.110.80.28, TEXTCC,504X18,27 
BB,0,2,WRITE,7 

BEGIN CAMERA SETTINGS F0R 0UTPUT 
BB AIM CAMERA AT SURFACE BB 

0UTPUT ENTIRE SURFACE BB (252X184) 

5 ) TRANSLITERATE O’S TO BLANKS (60)8 

5,60,1,72,3,4,5,60,54 ) AND 7’S TO ASTERISKS (5408) 

150 DA 1 FRAME, WITH “150” ON FRAME LINE 


TYPE “FIG. 2A 


CAPTI0N 


TYPE “0R” 

MOVE RIGHT, TYPE “0R” AGAIN 
D0 THE 4 SURFACES 
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* 

SAMPLE 

LINES 

CAMERA 

TRA 

SEG . . . 

LINE 

0RSUR 

THEN 

BCI 

TEXTAA 

BCI 

BCI 

BCI 

BCI 

BCI 

BCI 

TEXTBB 

BCI 

TEXTCC 

BCI 

TEXTVV 

BCI 

252x18 

BCI 

252X36 

BCI 

504X18 

BCI 

TEXTOR 

BCI 

FIG2A 

BCI 


BCI 

BCI 

BCI 

BCI 


1 

nmmm 

FINISH 

C,D, WRITE,?, 1,5000 

(C,T,D)QQ 

2,*( 0R 

2,*/SUR- 

2,VFACES 

2,*/ (Z}f 

2,VFIG. 

2,V 2*)B.*. 

1,AA*. 

1,CC*. 

1,VV*. 


) PR0DUCE 0N PRINTER ALL 0DD 
) C0LUMNS OF ALL 0DD LINES 

PR0DUCE PICTURE 

BEGINNING OF CL0SED SUBROUTINES 
) SUBROUTINE F0R DRAWING A 
) STRAIGHT SEGMENT 0F A LINE 
(DESCRIPTIONS OF TEXTS ) 

( TO BE TYPED BEGIN HERE ) 

( AND C0NTINUED T0 THE END. ) 


2,252*) X 184*. 

2,252* )X368*. 

2,504*) X184*. 

1,0*/R*. 

8,* (FIG. *)2A. *NAMES AND SIZES OF SURFACES 
8,*/LARGE ENOUGH T0 H0LD C0MPLETE FINE- 
8,*/RES0LUTiOn mOvIE FRAMES (252 X 184). 
8,*/*ALTERNATE USES 0F T0TAL 7090 ST0RAGE 
8,*/AREA ARE INDICATED, INCLUDING THE USES 
8,*/0F REGI0N *A*A IN WAYS SH0WN IN *FIG. 2B.* 




SIMULATION OF BIOLOGICAL CELLS BY SYSTEMS 
COMPOSED OF STRING-PROCESSING FINITE AUTOMATA’ 

Walter R, Stahly\ Robert W. Coffin, t and Harry E. Goheenl 


INTRODUCTION 

In the last few years enormous progress has 
been made in clarifying the operational mech- 
anisms of living cells. It has been established 
beyond reasonable doubt that all aspects of cell 
activity are controlled by sequences of elemen- 
tary genetic units. A comma-free triplet coding 
in the four-letter alphabet of DNA is tran- 
scribed on RNA and causes the formation of 
sequences of amino acids, which make up poly- 
peptides and proteins. Various theories of 
transcription control for such system.s are now 
under study. Recently, synthetic nucleic acid 
(RNA) chains have been fed into the cell 
machinery, thus demonstrating that protein 
synthesis can be controlled artificially. Numer- 
ous finer details of the problem could be men- 
tioned (see Crick,® Nirenberg,^^ Rich,^® Wad- 
dington,^'^ and AnfinsenU but shall not be con- 
sidered in this report. 

There arises the question of what type of 
mathematical modelling method is best suited 
for simulation of molecular genetics. In the 
past numerical models, based on chemical ki- 
netics expressed in terms of simultaneous dif- 
ferential equations, have usually been applied. 
Impressive results were obtained by Chance et 
al.,'^ Garfinkel,^^ Hommes and Zillikeni® and 
others. However, it has also become clear that 


simultaneous solution of hundreds or thousands 
of diiferential equations, many of whose coeffi- 
cients probably cannot be measured experimen- 
tally (see Pardee'^2)^ poses a difficult problem. 
Actual cells may contain thousands of different 
genes and hundreds of thousands of synthetic 
units. Major questions of solvability and stabil- 
ity of such systems must be faced in an attempt 
to model a complete cell. 

The present report describes a fundamentally 
different approach to the problem of simulat- 
ing a cell, some aspects of which were reported 
earlier in Stahl and Goheen.^^ Since genes and 
proteins are representable as linear chains or 
strings, it is proposed that molecular mecha- 
nisms of cells be simulated by string-processing 
finite automata. In this model strings repre- 
senting DNA, RNA, proteins and general bio-: 
chemicals are subjected to controlled copying, 
synthesis into longer strings and breakdown in- 
to shorter ones, with use of what may be called 
“algorithmic enzymes.'' A major property of 
these logical operators is that they are combin- 
able into large systems with complex properties. 

The materials below deal in turn with a new 
computer simulation system for studying finite 
automata, the properties of algorithmic en- 
zymes, experimental studies with systems of the 
latter and lastly with some questions of solv- 
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ability for model cells defined entirely by au- 
tomation-like enzyme operators. 

THE AUTOMATON SIMULATION SYSTEM 

The quintuplet command code proposed by 
Turing is used in the programming system and 
well described in Turing,®* Davis'® and else- 
where. Turing’s device was designed for proofs 
of computability and in principle requires an 
infinite tape and infinite number of recursive 
steps for such demonstrations. This circum- 
stance does not prevent one from using the 
quintuplet code for general purpose simulation 
of automata. The late John von Neumann®® ®® 
pointed out that the Turing Machine represents 
a means of programming or simulating any 
algorithm, as well as for computability demon- 
strations. McNaughton^® has emphasized that 
the Turing Machine should be considered as the 
most general of automata. 

A compiler based on the Turing quintuplet 
notation (but not really modelling the Turing 
Machine) has been designed and is described in 
Coffin, Goheen and Stahl.® Simulation of a con- 
siderable number of different automata on the 
system, including ones for pattern recognition, 
has revealed tlat a computer program for 
modelling of automata is a useful research tool, 
which may find practical applications when it 
is desired to use a “variable programmed au- 
tomaton.” Results substantiating this conclu- 
sion are reported in Stahl, Coffin and Goheen*'* 
and Stahl, Waters and Coffin.*® A special com- 
piler, constituting an “automaton simulation 
program,” was written for a SDS-920 com- 
puter and has processing rates of up to 10,000 
quintuplet commands per second. Rates of over 
one million automation commands a second 
should be possible with presently known tech- 
nology. A number of special provisions have 
been included for automatic sequencing of dif- 
ferent circumscribed algorithms or automata 
presented as lists of quintuplets, debugging, 
selective printout of string during simulation 
runs, and so forth. 

A Turing compiler should not be evaluated on 
the basis of the inefficient operation of most 
Turing Machines described in the literature to 
date. The authors are using individual Turing 
Programs (algorithms) exceeding 1700 quin- 
tuplets in size and a complete system (namely. 


the algorithmic cell), which includes over 
43,000 quintuplets. Interesting results have 
been obtained for the problem of recognition of 
hand-printed letters (A-Z) and shall be re- 
ported elsewhere (Stahl, Waters and Coffin*®). 
Naturally, automaton simulation has a special 
range of application, as do research compilers 
such as LISP, IPL-V, or COMIT. 

The Turing Machine is a device which oper- 
ates on individual symbols presented on a single 
long tape, along which a reading head moves 
left or right, one cell square at a time. A capa- 
bility is provided for erasing and writing indi- 
vidual symbols and for recording the “state” 
of the Turing Machine, which defines uniquely, 
its response to a particular viewed symbol. 

Only one type of program command is used 
and consists of a quintuplet (or matrix table 
with output triplets of sjrmbols), which usually 
appears as follows: symbol scanned, state of 
machine, new state, motion (right — R, left — L 
and stay in place — P) and symbol to replace 
existing symbol before motion is carried out. 
A quintuplet such as 12 A :15RB is read “if in 
state 12 and A is viewed, then replace A by B, 
go to state 15 and move right one square.” A 
final logical halt of the automaton takes place 
on such an entry as 26* :26P*, which is read “if 
asterisk is seen in state 26, remain in place, 
stay in state 26 and do not alter asterisk.” 

In principle, the Turing Machine must have 
available an infinite tape and amount of time, 
but precisely the same notation can be used with 
finite automata and this has been done by such 
authors as McNaughton,®® Myhill,®* Trakhten- 
brot*® and others. The quintuplet command 
structure need not in itself connote the extended 
and often inefficient “shuffling” operation of 
the classical Turing Machine. 

ALGORITHMIC ENZYMES 

The concept of a Turing quintuplet code may 
be illustrated with a simple but biologically pro- 
vocative example in which a finite automaton 
simulates a Jyific eiKyme fhat 'hrealfcB down 
strinj^. Table I is a quintuplet program for 
an “automaton enzyme” wffiich lyses strings in 
the alphabet (AGCT), representing the four 
nucleic acid bases adenine, guanine, cytosine 
and thymine. A typical input tape into the 
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automaton using this code might be 
. . . <^=A-C-G-C-C-T-T~A-G-C~A=<^ . 

( 1 ) 

in which ‘V"' — empty cells, — start of 

string, — ^bond between letters. 


Table I 

TURING PROGRAM FOR A SIMPLE 
LYTIC ENZYME 


1 <j> !lR<^ 

2 T:2RT 

1 =:2R= 

2 /:2R/ 

2 =:2P=r 

3 =:3P= 

2 A:2RA 

3 -:2R/ 

2 C:2RC 

3 /:2R/ 

2 G:3RG 


2 -:2R- 



Following a single left to right pass the string 
in (1) will be converted into 

=:A-C-G/C-C-T-T-A-G/C-A= (2) 


in which every bond immediately to the right of 
a G, regardless of what symbol is next to it on 
the left or right, is converted into an ''open 
bond’* (/). 

Operation commences at the left end of the 
string.. The empty symbols (<^) are passed over 
by entry 1 (^:1R<^. When the left end-marker 
(=) is seen control passes to state 2. In state 

2 all symbols except G (namely, A,C,T and -) 
are simply skipped over, as in 2 - :2R-. If G is 
seen, by entry 2 G :3RG, control passes to state 
3, which next encounters a "bond” and con- 
verts it to an "open bond” using entry 3 -:2R/. 
Provisions are also made for skipping over any 
existing open bonds, as in entry 2 / :2R/. Stop- 
ping occurs in state 2 or 3, on an entry such as 

3 = :3P=. 

Table II is a sample of coding for a string 
synthesizing finite automaton, which was de- 
scribed in Stahl and Goheen^^ and is the proto- 
type for the algorithmic enzymes noted in this 
paper. The cited work also includes automata 


for copying and complementary copying of 
strings (as in DNA transcription), and for more 
complex types of lysis. 

In general, the construction of quintuplet 
programs for automata is straightforward. It 
is noteworthy that they are truly interchange- 
able because of the very standard format. It is 
clear, however, that string-processing enzymes 
might also be represented by other types of 
automata, such as Wang’s^® modified Turing 
Machine and that it would be entirely feasible 
to design special compilers that accept a "state- 
symbol” table. 

While the lytic enzyme of Table I was given 
principally as an example, it is interesting to 
note that reconstruction of a parent protein 
string following the action of several lytic en- 
zymes is an important problem today for nucleic 
acid and protein analysis. Rice and Bock^^ have 
pointed out that application of three specific 
lytic enzymes, which split "next to” only three 
of the four specific bases in DNA, does not allow 
a unique reconstruction of the parent chain. 
This is an excellent example of algorithmic un- 
solvability arising in a biological context, and 
moreover even in very classical form, namely. 
solution of a "word problem” by algorithmic 
methods. 

It must be emphasized that the lytic enzyme 
of Table I in no way models the physiochemical 
properties of any real enzyme that might per- 
form the indicated lysis, and only simulates the 
string-processing aspects of the enzyme action. 
This type of model is somew’hat comparable to 
the McCulloch-Pitts^s imitation of neurons by 
the threshold Boolean "logical neuron,” in that 
both model systems are rather gross from the 
biological viewpoint, but involve a consistent 
mathematical methodology. The McCulloch- 
Pitts neuron can be combined into large sys- 
tems, such as perceptrons, and much the same 
step has been taken with algorithmic enzymes. 
The main problem in biological modelling is 
probably that of finding well-defined mathe- 
matical methods which can be applied with 
profit to the biological system. 
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Table II 

TURING PROGRAM FOR CONDITIONAL STRING SYNTHESIS 


1=:2R= 

6= :7L= 

10c:12Rc 

15= :16L= 

19c:19Lc 

1(^ rlR<^ 

6*:7L8 


15* :16L* 

19e:19Le 

la :llLx 

6a :8Ra 

11=:13R= 

15a:17Ra 

19x :19Lx 


6b:8Rb 


15b;17Rb 


2= :2P= 

6c :8Rc 


15c:17Rc 

20* :20R* 

2* :2R‘' 


lla:llLa 


20<i> :20R<^ 

2<^ !2R(^ 

7= :9R= 

llbrllLb 


20a :20Ra 

2a :2Ra 

7* :7L* 

llc:llLc 

16= :20R= 

20b :20Rb 

2b :2Rb 

Icf) lHucj) 

lle:llLe 

16<^ :19L* 

20c :20Rc 

2c :2Rc 

7a :7La 


16a :16L<^ 

20e:21r<^ 

2e :3Le 

7b :7Lb 

12= :12P= 

16c:16L<#> 

20x :20R<f> 


7c :7Lc 

12* :9R* 



II 

II 

CO 

7e:7Le 

1q(^ !l2R<^ 

17= :18L= 

21=:22R* 

3* :3L^ 


12a:12Ra 

17*:13R* 

21*:21R* 

S<f> :3L<^ 

8= :8P- 

12b:12Rb 

17^ *1TR<^ 

21^ i21R<^ 

3a :3La 

8* :4R* 

12c:12Rc 

17a:17Ra 

21a :21Ra 

3b :3Lb 

8<^ i8R</> 


17b :17Rb 

21b:21Rb 

3c :3Lc 

8a :8Ra 

13<^ rl7R<^ 

17c :17Rc 

21c:21Rc 


8b :8Rb 

13a :14Ra 


21e:21Re 

4<f, :8R<#> 

8c ;8Rc 

13b:17Rb 

18= ;18P= 

21x:21R<^ 

4a :8Ra 


13c :17Rc 

18*:18L* 


4b :5Rb 

9.^;12R<^ 

13e:17Re 

18<^:18L.^ 

22.;. ;23Ra 

4c :8Rc 

9a :10Ra 

13x :17Rx 

18a :18La 

23.^ :24Rc 

4e :8Re 

9b:12Rb 


18b :18Lb 

24<^ :25Ra 


9c :12Rc 

14= :18L= 

18c:18Lc 

25<f> :26L= 

5*:4R* 

9e:12Re 

14*:13R^ 

18e:18Le 


5a :8Ra 


14a:17Ra 

18x:18La 

26= :2R= 

5b :6Rb 

10=:1L= 

14b:17Rb 


26* :26L<l> 

5c :8Rc 

10*:1L* 

14c:15Rc 

19= :20R= 

264> :26L<t, 


10a :12Ra 


19*:19L* 

26a :26La 


10b:12Rb 


19,#.;19L</, 

26b:26Lb 




19a :19La 

26c :26Lc 




19b:19Lb 

26e:26Le 


The concept of studying cells algorithmically 
was probably implied in von Neumann's^*^ work 
on self-reproduction of structures composed of 
finite automata. This model has been exten- 
sively analyzed and extended by Burks^ and 
Moore.2‘> The growth and stability of autom- 
aton-like arrays is considered in Lofgren,^^ 
Ulam,^'^ Eden,^^ Blum^ and others. Rashevsky'^'^ 


suggested the application of the Markov “Nor- 
mal Algorithm"' (A. A. Markov^'*) to the genet'C 
codes, but did not define any complete cell 
model. Pattee*^^ proposed that a simple autom- 
aton could produce long biological chains of 
a repetitive kind. Induction and repression 
mechanisms in cells are analyzed by Jacob and 
Monod.2« Sugita^'^ and Rosen^^^ explore the 
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Boolean logical representation of cells. Turing 
himself wrote a paper^^ entitled "‘The Chemical 
Basis of Morphogenesis,” but used differential 
equations rather than algorithm theory in this 
study. Soviet workers such as Frank^^ and 
Pasynskiy®^ discuss some general aspects of 
cellular control theory. Lyapunov^^ proposes 
the systematic study of interacting automata 
subjected to executive hierarchies of control 
and Medvedev^'^ deals with errors in genetic 
coding. 

An ‘'algorithmic cell” is defined in this report 
as a system of string-processing finite automata, 
representing enzymes, together with the cell 
contents, identified as strings. Smaller bio- 
chemicals which are not as obviously strings as 
DNA or proteins may be coded as strings. This 
is done routinely in ordinary chemical nomen- 
clature. Furthermore, the linearly-coded en- 
zymes (proteins) must be able to recognize any 
normal biochemical in the cell, and this may be 
interpreted to mean that some sort of a string 
coding of biochemicals is possible. Biochem- 
ists represent DNA, RNA and proteins in an 
associative symbolic notation, but this fact has 
not been subjected to mathematical interpre- 
tation. That is, molecular biologists now assume 
that the four DNA bases, the four RNA bases 
and the 20 or so pertinent amino acids can and 
should be presented as symbols in an associa- 
tive (parentheses free) linear string notation. 
Much progress has been made recently in show- 
ing the details of the nucleic acid to protein 
coding (Nirenberg^D. It is noteworthy that a 
normal algorithm is implied in the DNA triplet 
to amino acid substitution coding, e.g., CAC 
■^Pro. (cytosine-adenine-cytosine codes the^ 
amino acid proline). Although the DNA to pro- 
tein coding is understood, almost nothing is 
known about the grammer, syntactical relation- 
ships or programming techniques used by the 
cell during its strictly deterministic, and there- 
fore algorithmic, growth, differentiation and 
functioning. 

Figure 1.1 is a flow chart of string syntheses 
taking place in a typical cell model of the type 
suggested in this report. At present 43 separate 
threshold algorithmic enzymes are used and 
produce a total of nine different final products. 
The complete model cell consists of the enzyme 


operators stored on magnetic tape, a 2000 sym- 
bol memory which has a place for each string 
and its numbers at any moment of operation, 
and three “housekeeping programs” (written 
entirely in quintuplets representing specialized 
finite automata) which perform addition, gen- 
eration of pseudo-random variables, totalling 
operations, counting, etc. While this type of 
programming is obviously not as efficient as 
a conventional computer methods for straight- 
forward arithmetic operations, it is a consistent 
automaton simulation methodology and very 
flexible. For example, total cell size may be 
controlled by simply introducing a special au- 
tomaton that changes rates of “diet” letter entry 
as a function of total cell string count. A com- 
plete growth experiment with an algorithmic 
cell requires about 30-45 minutes and involves 
hundreds of thousands of individual enzyme 
steps. 

The algorithmic enzyme used in the cell 
model is more complex than the one defined by 
Table II and functions as follows. Each enzyme 
is stored as a program of about 1000 quin- 
tuplets on magnetic tape and called into the 
core memory by the compiler program when a 
preceding enzyme or a “housekeeping automa- 
ton” writes in a “call number” in a specific 
region of the memory tape. As the first step, 
threshold checks are made of “energy” repre- 
sented as a simple string and of one or two 
control strings, by interrogation of the binary 
numbers associated with these strings. If these 
thresholds are met it is next ascertained that 
the substrates (input materials) for the specific 
enzyme are available in sufficiently high quan- 
tities and that the final product is not already 
present in excessively large amounts. If these 
added threshold conditions are met, the enzyme 
functions, producing a fixed quantity of its 
product, while removing appropriate quantities 
of the materials that went into the latter. After 
this is done the algorithmic enzyme writes in 
another “call number” which controls which 
housekeeping program or other enzyme shall 
function next. 

In Figure 1 the enzymes are identified by 
number, as #113, and the “k” value indicates 
synthetic rate for a unit cycle, while the strings 
along the flow lines are the control substances. 
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Figure 1.1. Main String Syntheses (each pathway is shown with algorithmic enzyme call number, rate and 

control substances). 


A complete set of specifications for the enzymes 
shown in Figure 1 will be published elsewhere. 
It can be seen from Figure 1 that certain “in- 
ducer strings,” such as AHA, HHH, etc., control 
the action of sets of coordinated reactions. 
Moreover, synthesis of these “control strings” 
is subjected to induction or repression as a 
function of certain special strings that may be 
supplied from the outside, as by an adjacent 
model cell, or manufactured in the cell if levels 
of “metabolite strings” meet certain arbitrary 
thresholds. A reasonable correspondence can be 
drawn between the activities of the algorithmic 
cell model and known basic cell activities, in- 
cluding blocking and unblocking of DNA, for- 
mation of messenger RNA, synthesis of enzymes 
and proteins on ribosomes, production of metab- 
olites by enzymes, etc. 


The flow chart of Figure 1 is entirely arbi- 
trary and does not attempt to represent any 
actual cell. It must be noted that a real cell 
may have tens of thousands of enzymes and 
genes, and that firm quantitative data is avail- 
able at present only for limited sets of enzy- 
matic reactions. Nevertheless, one may study 
the basic mathematical problems presented by 
string-synthesizing automata. The final prod- 
ucts of the algorithmic cell model, as shown in 
Figure 1 were designed to combine into several 
different two-dimensional arrays. An example 
of a unit in such an array is given in Figure 2. 
A “complementation algorithm” is applied to 
the product strings. This states that any two 
strings will stick together or polymerize pro- 
viding that they have at least three comple- 
mentary bonds, such as A with B, B with A, 
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Figure 1.2. “Diet,” “Energy,” and “Control” Substances. 


C with D, etc. Presence of a non-complementary 
bond negates two complementary bonds. This 
may be considered to be still another kind of 
discrete threshold action that generates com- 
plex entities from simplier units. 

The control strings are so chosen that they 
regulate the coordinated formation of one or 
more final products that polymerize together. 
There are sufficient product strings so that dif- 
ferent types of arrays are possible and may lead 
to long individual chains, cross-linked chains, 
circumscribed blocks of strings and even grids 
with a special border that limits further growth. 
All the operations leading to these results are 
defined by firm algorithms. However, pseudo- 
random variables are added to generation rates 
and this results in less “mechanical” action. 


The short string 
prevents regular 
cross-linking with 
product V, by fill- 
ing in open space. 

V 


See comments 
with unit VII' 


This small product 
can fit into (I+II) 
and prevent any 
further polymer- 
ization. 


This unit will stop 
any further cross- 
linking (single F-F 
tie too weak). 



Products 1 and II are induced 
by control substance CC 
and combine into a three 
piece unit. 


Strings III and IV are made 
in coordinated manner if 
BD is present. They can com- 
bine with (I+II) to give 
a long chain polymer. 


This cross-linking 
unit is made if sub- 
stance CCC is present. 

It can combine units 
from (I+II+^I+IV}j to 
give two-dimensional grid 
pattern. 


Units VI and VII are in- 
duced by substance AHAH 
and can combine with the 
(I+II) unit to give a 
long chain pattern. Cross- 
linking is possible but the 
AHAH control pathway does 
not result in production 
of cross-link unit V. 


COMPLEMENTATION ALGORITHM: Any two strings which collide are assumed 
to link or ’\)olymerize** if three or more complementary bonds are pre- 
sent (A-B,B-A,C-D, etc,). A non-complementary bond negates two 
complementary ones, A single collision is assumed to test all bonds. 


Figure 2. Two-Dimensional Configurations Associated with String Products of the Algorithmic Cell Model. 
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EXPERIMENTAL STUDIES WITH THE 
ALGORITHMIC CELL 

In a typical run the cell model reaches equi- 
librium after about some 40-50 passes through 
the sequence of algorithmic enzymes, whose 
activity or inactivity is determined by induction 
and repression conditions. During the run 
printouts of quantities of each string are ob- 
tained periodically, as on every second or fourth 
unit pass. Cycles of operation, totals of strings 
of given sizes, grand totals and total use of 
energy are also recorded and used for assessing 
the efficiency of a given cell control system. 

Among the specific experiments that will be 
performed with the cell model are included 1) 
curtailing or completely limiting input sub- 
stances; 2) introducing a '‘leakage'' of one or 
more substances; 3) limiting energy use or 
“suffocating" the cell with unremoved products ; 
4) suddenly eliminating (as if by surgery or 
trauma) the entire contents of certain reser- 
voirs; 5) adding large random variables to all 
the generation rates and watching to see if the 
system will stabilize or oscillate; 6) driving the 
system with periodic inputs of elementary 
“diet" letters and noting the transfer functions 
to other parts of the system; 7) “heating" or 
“cooling" the cell, by appling a simple constant 
multiplication factor to rates (k values) of all 
enzymes and then checking stability. Other 
tests are also possible. 

Another experimental study is directed to- 
wards finding out approximately what fraction 
of random changes in the quintuplet commands 
shall lead to a final cell system that “grows out 
of control," like cancer. With about 45,000 
quintuplets, each capable of many random 
“mutations," exhaustive testing is not practical 
even with this small system. However, it is 
possible to make random changes in representa- 
tive quintuplets of the more important enzymes 
and determine whether they cause the algorith- 
mic cell to die, shrink, grow wildly, interact 
abnormally with adjacent cells (that is, fail to 
differentiate properly), stop producing one or 
more of the required final products, etc. All 
these examples have clear-cut biological 
counterparts. 

Experiments are also planned on competition 
between two or more algorithmic cells, which 


are subjected to random mutations in selected 
enzymes. This is done by having the cells draw 
upon a common diet and share a pool of some, 
but not all, strings. It is then possible to model 
experimentally a kind of simulated natural 
selection, in which the cell that first reaches 
final “adult" size is selected over lagging ones. 

Self-reproduction, as such, can be modelled 
with the above system, and would involve hav- 
ing the algorithmic enzyme quintuplets listed on 
a long “gene" string, which is copied at time of 
reproduction. Self-reproduction becomes pos- 
sible when the length of the “gene" coding is 
sufficient to define a certain minimum number 
of operational enzymes which account for cell 
structure, basic metabolism, gene copying and 
over-all control of the entire simulated cell sys- 
tem. This type of self-reproduction differs in 
certain fundamental respects, i.e. exclusive use 
of strings from the “kinematic" or “grid" re- 
production considered by von Neumann, 
Burks^ and Moore,^^ and also does not resemble 
very closely the geometric growth systems of 
Ulam^^ or Eden.^^ The details of an existence 
demonstration for self-reproduction in an al- 
gorithmic cell model shall be considered else- 
where. 

As was noted above the final string products 
of the cell model combine or polymerize to form 
rather intricate two-dimensional arrays, some- 
what reminiscent of known protein polymers 
such as collagen or muscle fibers. Following 
polymerization, which at present is modelled by 
hand but could be represented on a digitally- 
controlled projection screen, certain specific 
sequences of letters may be read off around the 
margins of the two-dimensional arrays which 
may be chains, cross-linked chains, grids or 
irregular crystal-like structures resembling 
those described in Ulam.^^ Future studies will 
make use of algorithmic pattern recognition 
strategies to determine when an array has 
reached its final desired configuration and also 
what may be done to repair such an array if 
it has been injured, i.e., a corner chopped off 
a grid. In the biological world there is a clear 
counterpart to these abstract studies in the 
ability of cells and tissues to regenerate missing 
parts using strictly determinate procedures 
which are clearly coded in their gene sequences. 
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The described system is a sequential, not 
parallel, simulator. In the real cell it is known 
that tens of thousands, if not millions, of copy- 
ing and synthetic activities may proceed in 
parallel. It is possible to mimic the effects of 
many identical enzymes acting simultaneously 
by increasing rates of production, and to mini- 
mize real-time non-simultaneity problems by 
using a rather short unit time period, during 
which no one synthetic chain is produced in 
large quantity. However, future studies will 
explore various ways of simulating a parallel 
string-processing system. 

As noted, the above system does not attempt 
to model any real cell or the properties of real 
enzymes in a physiochemical sense. Recent 
studies of enzyme action (Labouesse et al .,21 
Monod et al.^®) have shown that they are 
formidable objects from the physiochemical 
standpoint, because they are composed of hun- 
dreds of individual amino acids and have a 
complex three-dimensional structure. They are 
now believed to function, at least in part, 
through physical dislocations of certain enzyme 
regions after a given substance becomes at- 
tached to a receptor location. One must keep in 
mind that these remarkable logical machines, 
which can be characterized as “coiled up^^ or 
“multiply connected’' strings, are determined 
solely by the linear array of their amino acids, 
which are in turn coded by the gene base 
sequences. Physiochemical models of this “con- 
formation" system have not been very satisfy- 
ing. 

Possibly the central question is choice of type 
of automaton to simulate an enzyme: it might 
use numbers, the Boolean variables 1 and 0, or 
letter strings, as discussed above. The differen- 
tial equations models can be subsumed under 
analog-type systems that process real numbers 
(concentrations and rates). Sugita^*^ and 
Rosen^^^ suggest a Boolean logical model of 
cellular biochemical mechanisms. Arbib^ and 
others have demonstrated the algorithmic 
equivalence of the Turing Machine, finite au- 
tomaton and McCulloch-Pitts net, so the Boo- 
lean method and general string approach are 
theoretically equivalent. However, it appears 
that string-processing methods are more appro- 
priate for contemporary molecular biology, 


which already uses biochemical strings (nucleic 
acids and amino acids in proteins) as a basic 
tool. 

ALGORITHMICALLY UNSOL VABLE 
PROBLEMS FOR AN ALGORITHMIC CELL 

One of the most interesting conclusions which 
can be drawn from the above model is that a 
cell composed of string-processing enzymes, 
coded by strings of genetic symbols, has definite 
limitations on what it can do, i.e., there are 
many plausible situations for which it would 
have no adaptive algorithm, or “definite proce- 
dure," to use Turing's own term.^^ 

One has to consider what kind of logical 
problems may be solved by a well-defined, cir- 
cumscribed system of finite automata of the 
above type, i.e., ones which may copy strings, 
synthesize or lyse strings while obeying a 
principle of “conservation of letters" (with the 
exception of some random variables imposed on 
reaction rates), and also perform certain types 
of blocking and unblocking actions, simulating 
cellular induction and repression. In addition 
the enzymes may be combined into structures 
that have barrier properties, i.e., which allow a 
string to pass from one region to another. En- 
zymes may also be linked into fairly com^plex 
subassemblies. Insofar as is known today, the 
cell does not contain specialized molecular au- 
tomata that perform binary logic, arithmetic, 
compute or “think" in any ordinary sense. 

A system of 43 algorithmic enzymes has 
logical abilities that are qualitatively far more 
complex than those of an individual enzyme, but 
still circumscribed. Consider, for example, what 
happens when an algorithmic enzyme contained 
in a virus attacks an algorithmic cell. This 
virus enzyme may lyse an important string of 
the cell, or use normal biochemical strings in the 
cell for its own purposes. The original cell may 
or may not be able to resynthesize the lysed 
products or to produce enough of them to over- 
come its losses. It may or not contain any pre- 
coded lytic enzymes that can dissolve the en- 
zymes or other constituents of the virus. 

What is much more basic, perhaps, is that 
there may exist no algorithm by which the cell 
system can identify a virus on the basis of its 
genetic or protein sequences, or from its initial 
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effects on itself, which may result in new prod- 
ucts that the cell is unable to recognize. If a 
cell attempted to ‘‘compute'’ the string composi- 
tion of a new enzyme (there is no evidence that 
cells do this) designed specifically to attack a 
virus, it would also have to solve the following 
problems, which have well-defined algorith- 
mically unsolvable counterparts in Turing Ma- 
chine theory (see Da vis, Trakhtenbrot^®) : 1) 
will a new proteolytic enzyme (all enzymes are 
known to be proteins) lyse other desirable pro- 
teins of the parent cell (the “applicability prob- 
lem" of a Turing Machine) ; 2) will a new pro- 
teolytic enzyme lyse itself (the ‘'self-applicabil- 
ity problem" of a Turing code applied to itself 
— see Trakhtenbrot^^) ; 3) will a new enzyme 
result in a stopping or equilibrium condition of 
the whole cell (the “stopping problem" of a 
Turing Machine — see Davis^^) ; 4) will a new 
enzyme give a final cell configuration which is 
compatible with all stages of coordinated 
growth and development (“translatability prob- 
lem," or predicting final cell configuration 
when given an earlier one) ; 5) is the “com- 
puted" enzyme the most efficient one (this re- 
lates to the algorithmically unsolvable problem 
of minimality of a Turing Machine coding — see 
McNaughton^®) ; 6) will the system with the 
new enzyme be stable and resist instability 
during certain possible cellular situations. 

Quite similar problems arise if a cell attempts 
to distinguish a cancer cell from a normal cell, 
so as to be able to kill it. It is pertinent to ask 
if the normal defensive cells perform a kind of 
Turing's Test^^ on potentially cancerous ones, 
acting in some abnormal way. 

As a specific example, suppose a hypothetical 
cellular automaton were engaged in design of 
an enzyme of about 150 amino acids (given in 
associative notation, as Asp-Lys-Glu-Lys . . .) 
which could attack a virus. One would have to 
know how this enzyme would function during 
all past and future stages of growth and dif- 
ferentiation. Information about the latter is 
present in the genes, but it is not clear how the 
cell could get access to the desired information 
without a “dump" from gene memory, which 
might result in loss of recursive control. In 
effect, the cell would have to model its own 
condition during all possible earlier and later 
states of differentiation. Moreover, the cell is 


unable to predict what sort of other biochemi- 
cals (or viruses) might be present in the envi- 
ronment at later times or the effects of the new 
potential enzyme on its status in natural selec- 
tion against other cells. Furthermore, the 
postulated cellular automaton would have to 
anticipate the final three-dimensional structure 
of an enzyme, with its “conformation" (physical 
folding) properties in abstracto, since once a 
new enzyme was actually made it might lyse 
the automaton in question or otherwise inter- 
fere with normal cell controls. 

Reasoning of the above type would suggest 
that acquired characteristics are not inherited 
because in the general case a cell attempting to 
carry out the indicated adaptation is faced with 
algorithmically unsolvable problems. If direct 
genetic adaptation were possible it would confer 
a tremendous advantage and basic reasons 
why it does not occur should be sought. How- 
ever, the problem is complicated by the fact 
that real cells are much more complex than the 
abstract model of this report and is discussed 
further in another report (StahD^). 

It must be noted that the described model 
does not adequately simulate either the parallel 
string processing or real-time processing prob- 
lems in a cell, in which a given algorithm must 
be completed in approximate time synchronism 
with existing developmental algorithms. 
McNaughton,2® Rabin and Scott, Burks, ^ and 
others have considered some aspects of com- 
putability for “growing automata" and real- 
time computability problems. The deliberate 
design of new algorithmic enzymes might also 
involve certain questions of solvability for 
multiple-tape Turing Machines, the isomor- 
phisms (Sorkin'*^) and composition of automata 
(Glushkov^®), one-way or back-and-forth read- 
ing machines (Schutzenberger^D, state identi- 
fication experiments on existing molecular au- 
tomata (Gill,^4 Ginsbergis), and adaptive abili- 
ties of automata (Tsetlin^®). 

Algorithmic unsolvability may be referred to 
Turing Machine theory or to general recursive 
computation theory. If the latter methodology 
is used one would presumably reach the con- 
clusion that no decision procedure existed for 
the choice of amino acids (or gene bases) for a 
given adaptive problem. In the case of a malig- 
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nant cell one might say that there was a faulty 
“transfer of control/' or lack of a “stop condi- 
tion/' This type of statement is considered 
wholly acceptable to cancer specialists at pres- 
ent and is implied in contemporary discussions 
of the cancer problem. Turing Machine un- 
solvability has the important advantage of 
defining several qualitatively distinct unsolv- 
ability situations, such as the “stopping prob- 
lem," “translatability problem," “self-applica- 
bility problem," “minimality problem," etc. 

The algorithmic cell differs from “learning 
models" (see, for example, Pask®^) and certain 
of the systems described under the term “self- 
organizing systems" (Yovits and Cameron^^) 
because, as contrasted with the latter and the 
brain, it cannot “learn" its environment, and 
only responds to it selectively with available 
precoded algorithms. Interesting problems arise 
in the study of neurons represented as string- 
processing systems. Adaptation by the brain 
does not, of course, imply adaptation by the 
genes in neurons. Algorithmically unsolvable 
problems would clearly arise if “new" brain 
circuits were to be designed by a hypothetical 
neuronal molecular automaton. For example, 
Hennie^'^ has stated a number of unsolvable 
problems connected with iterative nets, which 
may be composed of non-specific threshold 
neurons. 

To state an example, it appears plausible that 
the deliberated design of a new integrative cir- 
cuit by a frog brain, for the purpose of detection 
of some new specific type of a bug projected on 
its retina and based on an iterative type of cir- 
cuit, might involve algorithmically unsolvable 
problems. Pursuing a related line of reasoning, 
Cannonito® has discussed Godel's incomplete- 
ness theorem as applied to intelligent machines. 

DISCUSSION AND CONCLUSIONS 

Careful study of the existing literature in 
both molecular biology and finite automata 
theory, including also recent Soviet reports, 
suggests that the model proposed in this report 
has not been used before. At present it is in a 
developmental stage, and is running as de- 
scribed on an SDS-920 computer. 

It has been noted above that the model does 
not attempt to use numerical data from bio- 


chemical studies. However, it will be tested in 
this capacity, with use of finite approximations 
to enzyme kinetics. Quantitative modelling will 
become really meaningful when a sufficiently 
complete and accurate set of parameters is 
available for an actual cell. Most existing quan- 
titative biochemical models are of the steady- 
state compartment type and demonstrate 
changes of equilibria in open systems. The pres- 
ent model is designed, on the other hand, to 
study coordinated growth and progressive dif- 
ferentiation of a cell. Both types of studies 
obviously have a place. 

Since real cells are so complex, and may in- 
clude thousands of different enzymes present in 
many copies, there arises the very real question 
of what kind of mathematical or algorithmic 
tool is best suited for the simulation of enzymes 
and cells. In this report it is suggested that 
finite automata are a suitable axiomatic tool. 
While the proposed model is imperfect, it does 
allow a new and mathematically consistent 
representation of such extremely characteristic 
biological phenomena as differentiation, induc- 
tion and repression controlled by environmental 
biochemicals, coordinated grov/th, development 
of cancers, competition between cells, etc. The 
need for some new mathematical attack on the 
problem has been strongly stressed by such 
senior biologists as Waddington.^^ 

As concerns the Turing quintuplet pro- 
gramming technique, it is more efficient than 
has been supposed and is well suited for study 
of the properties of systems of automata. It is 
effective where intensive logical processing of 
a non-standard variety is to be done on a rather 
small amount of input information, but, ot 
course, extremely inefficient where much infor- 
mation must be used at one time or for con- 
ventional numerical processing. Other reports 
will take up the useful range of application of 
the described automaton simulation program- 
ming technique. 

A model of a general synthetic system may 
also be of interest in realms other than cellular 
biology. For example, it may be proposed that 
in any general production process the addition 
of a letter to a string can represent a “next 
processing step," while joining of smaller 
strings indicates combination of subassemblies 
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or completed parts. Alternatively, each proc- 
essing step may stand for some specific infor- 
mational processing task. Abstract models of 
this type are interesting for studies of para- 
sitism and suggest, for example, that viruses, 
tapeworms and human embezzlers might be 
subsumed under one consistent theory of events 
in “synthetic productive systems.” The analogy 
is loose and speculative, but provocative. 

As another example, it has been known for 
several years that mammals are able to defend 
themselves non-specifically against viruses by 
production of a genetically precoded substance 
known as “inteferon” (Isaacs**). The mode of 
action of this complex biochemical material is 
not clear, but when it is discovered it will be 
interesting to study what algorithm controls its 
release and how it relates to the unsolvability 
problems discussed above. There is also much 
interest in the study of algorithms that may be 
used to distinguish normal from malignant cells. 

Work described in the above is intended as a 
starting point for studies which deal with 
specific problems in the design of cellular con- 
trol algorithms that are stable in the presence 
of environmental fluctuations, coding errors 
(mutations) and can to some extent resist para- 
sitic systems such as viruses or cancer cells. 
Compared with any real cell the model is very 
crude, but it bears much the same relationship 
to real tissue cells as the McCulloch-Pitts 
neuron has to actual brain neurons. The logical 
neuron has been useful as a firm mathematical 
tool for study of the over-all properties of 
brain models and the algorithmic cell can serve 
in a very similar capacity for systems of cells. 
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The modern digital computer has made major 
contributions to sociology in at least two gen- 
eral areas. Because of the computer’s high 
speed and its ability to handle eifectively the 
interrelationships among many and complex 
variables, we are enabled to perform more so- 
phisticated analyses of data already gathered. 
For the first time it is possible to deal statis- 
tically with social systems as systems. Poten- 
tially of even greater benefit is the second con- 
tribution — the use of computers to generate 
data for the purposes of theory testing and 
development. 

It has been traditional in sociology to express 
theory in verbal formulations. Words, of course, 
are weak tools for achieving precise statements 
of propositions, for studying the interrelation- 
ships among variables, and for learning the 
consequences for a social system if it operates 
according to the processes defined or implied 
by the theory. Once the processes of a theory 
are expressed as routines in a computer pro- 
gram, we can simulate the behavior of a social 
system which operates according to the rules 
of the given theory. To the extent that the ob- 
served outcomes of an experiment in a com- 


puter run match the observed outcomes of the 
identical experiment in a real-life system, the 
rules of the theory may be considered a suffi- 
cient (though not a necessary) explanation of 
the behavior. 

Let us turn now to a description of HOMUN- 
CULUS,^’^ an operating program in Informa- 
tion Processing Language, Version designed 
to simulate human interaction in small groups. 
The computer model is based on the theory ad- 
vanced in a recent book, Social Behavior, by 
George C. Homans, which presents an explana- 
tion of elementary social behavior — that is, of 
“face-to-face contact between individuals, in 
which the reward each gets from the behavior 
of the others is relatively direct and immedi- 
ate”.^ Homans’ model is one of social exchange, 
envisaging human behavior as a function of 
its payoff : in amount and kind, an individual’s 
responses depend on the quantity and quality 
of reward and punishment his actions elicit. 
To explain elementary social behavior, Homans 
advances five propositions, based primarily on 
experimental psychology and classical econom- 
ics, dealing with the effects of such factors as 
the frequency and recency of reinforcement, 
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stimulus and response generalization, and rela- 
tive deprivation and satiation. 

To simplify the problems of expressing con- 
cepts concerning human social behavior in a 
computer program, we followed the tactic of 
beginning with a simple interaction sequence 
between two hypothetical individuals. In The 
Dynamics of Bureaucracy, Peter Blau of the 
University of Chicago describes in detail a fed- 
eral civil service office in which the men studied 
held the same title but varied in competence.^ 
As expected, the more skilled workers received 
more requests for assistance from their co- 
workers. While the participants in such con- 
sultations benefited, they still incurred costs in 
their exchange. That is, the agent requesting 
help was usually rewarded by being enabled to 
do a better job; however, he paid the price of 
implicitly admitting his inferiority to a col- 
league who by title was supposedly his equal. 
The consultant, on the other hand, gained pres- 
tige ; nevertheless, he incurred the cost of time 
taken from his own work. 

In our computer model of this social exchange 
we have programmed an interaction sequence 
beginning with an agent, Ted, emitting a sym- 
bol representing a request for assistance to his 
co-worker, George. The flow chart depicted in 
Figures 5 through 6 outlines our interpreta- 
tion of the processes involved in operational- 
izing Homans' propositions for elementary 
social behavior. In our illustration, the pro- 
grammed statement of Homans' propositions 
specifies the symbol-manipulating processes 
which enable our hypothetical consultant, 
George, to decide what action he will emit in 
response to Ted's request. 

As we began to develop flow diagrams for the 
specific routines to represent each proposition, 
we found that the discipline of making decisions 
about how to program a computer to simulate 
social behavior gave us a different perspective 
in our thinking about how humans react in so- 
cial contexts. This new frame of reference 
forced us to make explicit our conception of a 
person as an information-processing system. 
If our model of a person were to behave accord- 
ing to the principles set forth in Homans' ex- 
planatory propositions, the format in which he 
was described in the computer had to be such 


that he could perform at least the following 
activities : He had to be able to receive stimuli, 
recognize stimuli, store stimuli in memory, and 
compare and contrast stimuli ; he had to be able 
to emit activities, differentiate reward and pun- 
ishment, associate a stimulus situation with a 
response, and associate a response with a rein- 
forcement; and, on the basis of past experience, 
he had to be able to predict the consequences of 
each contemplated response. 

In the group setting he had to be able to dis- 
criminate among the members of the group, 
evaluate each social stimulus in terms of the 
specific person emitting it, and select a response 
appropriate to the specific other person in the 
given group. Our model thus pictures a person 
as an active hypothesis-testing organism capa- 
ble of receiving, storing, analyzing, and 
reconstructing information. These human in- 
formation-processing functions bear obvious 
similarity to the digital computer's symbol- 
manipulating capacities. Even so, the process 
of constructing such a social being in the com- 
puter was no easy task. 

Fortunately, IPL-V was designed to handle 
the types of problems we faced. In our model a 
person is represented as a list structure com- 
posed of a hierarchy of list structures, lists, and 
description lists. A simplified presentation of 
the organization of a list structure of a person 
is depicted in Figures 1, 2, and 3. Among the 
data included are lists specifying an individual's 
identity, his general level of ability and his level 
of skill in social interaction and in certain 
specific tasks, his relative and absolute position 



Grm firtif Smp tmp 


I 1 1 1 1 1 

hni* ir NM|( It half K ktiti i4 biilii 

fall* Mf K ftlNr ••• ItlMf iMfli 

iti*tr Makii |Tii| i 


Figure 1. 



COMPUTER SIMULATION OP HUMAN INTERACTION IN SMALL GROUPS 


105 


M 

_L 


Tii K !•«»•< >■***»*■ 

1 hrsM fr«i Ti< *• 


I If TM'S 
•CIS If 


-1 r 


—I 1 

lifNSt litri*«tiii 

fm m U 6fii» 


"T 


Nciift Mwitti 

ti ||Kii«i iMliVli 


Figure 2. 


Responses f hive Made 


Givc^^Mp 

RiRKstei 

1 

1 — 

Referred 
ti Anther 
PersM 

1 — 

Referred 
ti Writtei 

Docineits 


Ciild^ Hit 

Help 

Rejected 
Reqiest as 
Uireasoialle 

Cist II 

Tine 

Cos/ ii 

Eiergy 

1 

Freiueicy 

This Act 

Rewarded 

Strelfth 

Reward 

1 

~l 

Teds 

Resmnses 

J 


Apirovil 

Reqlest 

Firthcr 

Help 

Evade^ 
Recotniziii 
Help Ret d. 

Disapprove 
of Help 
Received 



VMie Appripriateiiss Oeprivatin- Frepwicy 


of Ted s 

of Activity ii 

Satiatiii 

of Ted s 

Approval 

This Caetext 

Score 

Approval 


Figure 3. 


in certain groups, his images of his membership 
and reference groups, and lists containing ele- 
ments of his past history and his resulting val- 
ues and needs. Through exposure to various 
experiences an individual’s list structure be- 
comes modified to include a record of them. 
Thus, at the beginning of a run, he may be rep- 
resented by a list structure of as little as 200 
words. By the end of some fifty or one hundred 
interactions, he may require a thousand or more 
IPL words to describe him. In effect the indi- 
vidual learns from his experiences and keeps in 
memory the information that will enable him to 
behave more appropriately in future inter- 
actions. 

The processes expressed in Homans’ proposi- 
tions determine the behavior of the person- 
models in HOMUNCULUS. Homans assumes 
that his propositions explaining elementary so- 
cial behavior apply generally to all humans, re- 
gardless of their cultural backgrounds and insti- 
tutional relationships. In operationalizing Ho- 
mans’ propositions, therefore, we designed the 


routines to be common for all simulated per- 
sons. However, the routines are so written that 
the information retrieved from the data list 
structures describing individual participants 
determine whether certain subroutines will be 
executed and what their outputs will be in 
specific interaction sequences. 

Propostion 1. To illustrate some of the proc- 
essing involved in our model as well as some of 
the "problems encountered in translating verbal 
statem.ents into computer routines, let us con- 
sider the dynamics of Proposition 1 (PI, Boxes 
IV through XXII in Figures 4 and 5). Ho- 
mans states this explanatory proposition as 
follows : 

If in the recent past the occurrence of a par- 
ticular stimulus-situation has been the occa- 
sion on which a man’s activity has been 
rewarded, then the more similar the present 
stimulus-situation is to the past one, the more 
likely he is to emit the activity, or some simi- 
lar activity, now.® 

Essentially this describes the effects of the psy- 
chological principles of stimulus generalization 
and response generalization. In formalizing 
this proposition as a computer routine, how- 
ever, we found that the apparent clarity of the 
verbal statement actually cloaked certain am- 
biguities. 

What, for example, is a “particular stimulus- 
situation?” In terms of the specific interaction 
sequence we have selected, does it include every 
request for help one has received, or does it 
include only requests from the same person for 
assistance on the identical problem within an 
identical social context? We decided to write 
routines to process in sequence two aspects of 
the “stimulus-situation.” That is, we hypoth- 
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esized that a person receiving a stimulus, 
such as a request for assistance, would first re- 
act in a relatively global manner to the general 
situation itself; then he would consider the 
situation in terms of the specific individual 
emitting the stimulus. Thus, in our example, 
we interpreted the initial information process- 
ing implied by Proposition 1 (see PI, Box IV in 
Figure 4) to involve consideration by our office 
worker, George, of whether the activity received 
— in this instance, a request for help — is a gen- 
eral stimulus-situation in which his responses 
have been rewarded. In executing this process, 
a subroutine representing one of George's in- 
formation-retrieval capacities searches a mem- 
mory list of reinforced stimulus-situations to 
determine whether the present input is among 
them. 

Let us take the positive branch of the flow 
diagram and thus assume that George has found 
that in the past he has been reinforced for re- 
sponding to a request for assistance. His next 
consideration is outlined in Box X of the flow 
chart in Figure 5. Now George must determine 
whether his responses to a request for help have 
been rewarded by Ted, the person currently 
introducing the stimulus-situation. 

In checking on past interactions with Ted, 
George must search deeper into his memory 
structure. At this point one subroutine locates 
George's image list of office colleagues, finds 
the sublist describing Ted, determines whether 
Ted has emitted the present stimulus to George 
previously and if so whether Ted has generally 


rewarded George's responses to such requests. 
If he thus determines that Ted has been an 
agent of reinforcement in past interactions in- 
volving requests for assistance, then George 
proceeds to retrieve £he types of responses he 
has emitted to Ted. In our program George se- 
lects up to three activities from a memory list 
of responses Ted has rewarded (Box XXII, Fig- 
gure 5) ; then he proceeds to process further 
information regarding these contemplated ac- 
tions in order to select a response which he 
anticipates will net him a social profit. 

It is apparent that we have adopted as a 
tactic the policy of including, as a group, stim- 
uli which resemble each other but are not nec- 
essarily identical. This means, however, that 
our programmed people cannot discriminate 
degrees of similarity. No great difficulties arise, 
of course, in constructing routines to test grada- 
tions of similarity as called for by Homans' 
proposition. When we do include a test of 
similarity, it will require that we be more pre- 
cise than Homans in specifying what we mean 
by “more" and by “similar." To develop this 
discrimination we plan to build a list of attri- 
butes describing various activities. One such 
attribute would have scaled values indicating 
the expenditure of energy required to emit an 
activity ; another would have a range of values 
denoting the amount of time required to com- 
plete the given activity. Once these description 
lists for activities have been completed, a rou- 
tine can be written to compare activities, deter- 
mining the number of attributes two activities 
have in common as well as the degree of simi- 
larity in the values of their attributes. The 
criteria for judgment of similarity can be set 
as fine or as broad as desired. Our interest will 
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be the development of routines which simulate 
human decision processes as closely as possible. 

Up to this point in our discussion of Proposi- 
tion 1 we have considered only the positive 
branches — that is, what happens if George finds 
that the general stimulus-situation has been a 
rewarding occasion and that Ted has been an 
agent of reinforcement. But one may note that 
most of the items included in the flow diagram 
in Figures 4 and 5 outline processes accom- 
modating the negative branches. For example, 
if George finds that in the past his responses to 
a request for help generally have not been re- 
inforced (Box IV, Figure 4), he would still 
wish to ascertain whether he has interacted 
with Ted before (Box V) and if so, whether 
such interactions have proved rewarding (Box 

VII) . If past interactions with Ted have had 
positive consequences, George may then select 
a response Ted has previously reinforced to try 
in this stimulus-situation (Box IX). Otherwise, 
George may decide to withdraw from what is 
likely to be another unrewarding situation (Box 

VIII) . Even if he has not interacted with Ted 
previously (Box V), if George is a gregarious, 
other-directed American, he may still wish to 
continue the interaction ; therefore, he can se- 
lect some person similar to Ted (in terms of 
such attributes as sex, age, relative position in 
the group, etc.) to use as a reference source for 
determining appropriate action (Box VI), 

Let us now consider the processing involved 
in the negative branch for the inquiry concern- 
ing the social context of the stimulus-situation. 
To be at this point in the program George would 
have ascertained that, in the past, responding 
to a request for help had generally proved re- 
warding (Box IV, Figure 4) ; however, he 
would have found, in considering the request in 
terms of the current stimulus person, that his 
responses had not been rewarded by Ted (Box 
X, Figure 5). A relevant question then is 
whether George previously has received this 
stimulus from Ted (Box XI). If he has not, 
then George will ascertain whether he really 
has interacted with Ted before (Box XII) ; if 
not, he will select a colleague similar to Ted to 
use as a reference source in processing Ted's 
request (Box XIII). To indicate the possibility 
that George is using a surrogate for the stim- 


ulus person in making decisions, references to 
Ted appear in parentheses. 

If George finds that Ted has previously re- 
quested his assistance (Box XI), then the re- 
sponses he made to Ted’s request should be 
avoided since they did not lead to reinforcement 
(Box XIV). Indeed, in such a case George will 
wish to evaluate his relationship with Ted to 
determine whether interacting with him is 
worthwhile (Box XV). If it is not, he may end 
the interaction sequence then and there (Box 
XVI). *On the other hand, if George has en- 
joyed contact with Ted (Box XV), he might 
try to recall whether he has observed Ted’s ask- 
ing other co-workers for assistance (Box XVII) 
and, if so, what their reactions were (Box 
XIX). If he lacks this knowledge of group 
activities (Box XVII), George still can select 
rewarded responses to requests for help from 
some other colleagues (Box XVIII). In either 
case, George will then ascertain whether the 
responses suggested are of the same type as 
those he emitted previously without receiving 
reinforcement from Ted (Box XX). If they 
are, George has little choice but to withdraw 
from what would probably be an unrewarding 
situation ; otherwise, he can try again to elicit 
a positive reaction from. Ted by emitting one of 
the untried responses (Box XXI). 

Proposition 2. Homans’ second proposition 
deals with the positive influence of the fre- 
quency and recency of reinforcement : 

The more often within a given period of 
time a man’s activity rewards the activity 
of another, the more often the other will 
emit the activity.^ 

Reformulating this proposition for computer 
simulation posed a number of problems. It 
would have been relatively simple merely to set 
a counter for each reinforced response and then 
retrieve the desired information regarding re- 
ward frequency. However, we felt this proce- 
dure would not adequately simulate human in- 
formation-processing systems. Of course, peo- 
ple do avail themselves of precise measurement 
scales and use various cultural artifacts — such 
as computers — to increase their accuracy. But 
in making estimates concerning frequencies 
and values of rewards ensuing from everyday 
social interaction, people seem to use a less re- 
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fined means of measurement. In programming 
this proposition, therefore, we devised a rather 
crude five-point ordinal scale for reward fre- 
quency, ranging from an estimate that a re- 
sponse was ‘‘nearly always rewarded/' through 
a judgment that it was “rewarded about half 
the time," to an assessment that was “almost 
never rewarded." 

At present we are experimenting with differ- 
ent means of manipulating this scale. One rou- 
tine we have written increases the ordinal scale 
value for the reward frequency after three re- 
inforcements of the response. This procedure, 
however, is not completely satisfactory. Indeed, 
one may argue that estimates of reward fre- 
quency are not necessarily independent of the 
emotional salience of the reinforcement. When 
HOMUNCULUS has reached the stage of simu- 
lating small-group behavior in controlled condi- 
tions, it should be possible to test various ap- 
proximations of human judgments of reward 
frequencies from social interaction and to se- 
lect the routines which simulate the actual be- 
havior most accurately. 

When the processing for this proposition is 
completed (P2, Box XXIII, Figure 6), George 
has a rough estimate of the frequency with 
which Ted has rewarded each of the activities 
he is considering in response to Ted's current 
request for help. Homans' Proposition 2, taken 
alone, would lead to the expectation that George 
would then merely emit the most frequently re- 
warded response alternative. But other infor- 
mation must be processed before a decision is 
reached. 

Perhaps here we should indicate how the 
program keeps all this material in immediate 
memory for George. Up to one hundred named 
private storage cells are assigned for this pur- 
pose, and instructions in each routine specify 
which cells it is to use for storing its findings. 
At present, George is using about fifty of these 
cells. In addition, important information avail- 
able to all group participants — for example, 
what could be seen and heard during the last 
five interactions — is kept in named public stor- 
age cells. 

Proposition S. Among the other relevant fac- 
tors that must be considered in selecting an 
activity to emit is the value of the anticipated 


reward. Homan's third general proposition 
states, 

The more valuable to a man a unit of the 
activity another gives him, the more often 
he will emit activity rewarded by the ac- 
tivity of the other.^ 

Assessing the value of an activity is some- 
what more complicated than estimating the fre- 
quency with which it occurs. Value has two 
components — one relatively constant, and the 
other, which we shall discuss in Proposition 4, 
relatively variable for the periods of time in- 
volved in the simple interactions comprising 
elementary social behavior. The value compo- 
nent referred to in Proposition 3 concerns an 
individual's rank-ordering of the subjective re- 
ward attendant on receiving one activity rather 
than another. With reference to our example, 
we might predict that George would find warm 
social approval involving Ted's complimenting 
him in front of colleagues to be more “valuable" 
than a half-hearted response of “Hmm, thanks," 
or an annoyed retort, “Well, sorry I bothered 
you." 

At this point in our program, therefore, we 
have what game programmers term a “look- 
ahead." In considering Ted's request, George 
has “in mind" three responses he recalls Ted's 
having rewarded in the past, and he has esti- 
mated the frequency with which Ted has rein- 
forced each response. Now he must consider 
more carefully the particular reward he expects 
Ted to give to each response so that he may de- 
termine the inherent worth of each anticipated 
reward (P3, Box XXIV, Figure 6). Taking 
in turn each activity George is contemplating, 
the routines executing this proposition retrieve 
the responses Ted has previously made to each, 
determine which one he is likely to emit now, 
and search description lists to find the subjec- 
tive value of the reward for George. 

Proposition -4. Homans' fourth proposition 
deals with the other component of value — the 
deprivation-satiation aspect, or the marginal 
utility of a given unit of activity. 

The more often a man has in the recent 
past received a rewarding activity from 
another, the less valuable any further unit 
of that activity becomes to him.® 
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In contrast to the relatively constant intrinsic 
satisfaction aspect of value, the deprivation- 
satiation component varies over a range of 
possible rankings. Taking into account the 
amount of an activity a person has received, we 
note that he “values'" that activity more when 
he has been deprived of it than he does when he 
is in a state of relative gratification. Thus, 
while social approval may be highly rewarding 
to an individual, if in the recent past he has 
received a great deal of this generalized rein- 
forcer, then he is not likely to be so interested 
at the moment in receiving more. 

In processing the information necessary for 
completion of this stage of the program, George 
must evaluate his relative deprivation with ref- 
erence to the rewards he anticipates from Ted. 
George now has in immediately available mem- 
ory a record of each activity he is contemplating 
and stored with each activity is various infor- 
mation about it, including the response he ex- 
pects Ted to make to it. The routines which 
execute Proposition 4 search the description 
lists of each of the anticipated rewards to deter- 
mine the degree of George’s current deprivation 
or satiation with respect to them. A depriva- 
tion-satiation score based on a simple ordinal 
scale is stared as the value of a special attribute 
on the description list of each activity. In ex- 
ecuting Proposition 5, which we shall discuss 
later, routines update the deprivation-satiation 
score whenever an activity is received. 

With the information retrieved thus far 
George has an estimate of the relative frequency 
with which Ted has rewarded each activity he 
is considering emitting. Furthermore, he has 
predicted Ted’s reaction to each of the pro- 
jected actions and has determined how reward- 
ing each of these anticipated reactions is to 
him, personally, as well as how deprived or 
satiated he currently feels with respect to each 
of these expected rewards. At this point, 
therefore, George can rank his contemplated 
responses in terms of their expected payoff. 
But he is not yet ready to emit the highest 
ranked action. 

Another important consideration is the cost 
of the proposed response. Homans defines the 
cost of an activity as the value of the reward 
obtainable through an alternative activity fore- 


gone in emitting the given one. In our example, 
George must forego working on his own assign- 
ment if he takes time to assist Ted; therefore 
George must determine the relative reward 
value of this alternative activity. To do this he 
follows a procedure analogous to that just de- 
scribed, processing information concerning the 
frequency of past reinforcement and the value 
of the anticipated reward ensuing from this 
activity as well as his relative satiation with 
the reward. Then he can compare the overall 
expected reward from his contemplated re- 
sponse to Ted with the anticipated reward from 
continuing with his own work, and he can com- 
pute what Homans terms the psychic profit — 
the reward of an activity less its cost. 

Let us suppose George is tentatively plan- 
ning to give Ted direct assistance on his prob- 
lem because in the past Ted has praised him for 
this activity, and social approval is a reinforce- 
ment George values highly and one for which he 
feels relative deprivation at present. But let us 
also suppose that George has an important as- 
signment to complete,, and that taking time 
from it might detract from the quality of his 
work and thus lessen the approval he anticipates 
from his boss for a good job. In this case 
George would incur a loss rather than a profit 
in helping Ted directly ; therefore, he will con- 
tinue processing to see whether one of the other 
activities he was contemplating might yield a 
profit. In this illustration, George will proba- 
bly decide that referring Ted to another source 
will net him a profit, since he expects some 
approval for this activity (albeit less than he 
would get from directly assisting Ted), and he 
will incur a very small cost in terms of time 
taken from his own work. 

Having selected what he expects to be a so- 
cially profitable activity, George emits that re- 
sponse to Ted. At this point our program cycles, 
and the activity George has emitted becomes 
the activity Ted has received. Now Ted must 
process information in order to select an ap- 
propriate and profitable response to George. 

Proposition 5, Distributive justice, the sub- 
ject of Homans’ fifth proposition, is perhaps 
the most complex of the concepts involved in 
the explanation of elementary social behavior. 
At the very least it requires consideration of 
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information at another level — that of social 
norms or accepted expectations for behavior 
ivithin a group. Through repetition of interac- 
tion situations within a group, certain behavior 
patterns become stabilized so that expectations 
develop regarding what constitutes justice in 
the distribution of rewards and costs between 
persons : The greater a man's costs in a given 
interaction, the greater his rewards ought to be 
But the implications of distributive justice go 
even further, taking into account a person's 
investments in an interaction — for example, his 
seniority, skill, experience, age, and sex. The 
greater the investment a person makes in an 
interaction, the greater the net profit he has a 
right to expect. Thus, according to the principle 
of distributive justice it is consensually ex- 
pected that certain antecedent costs and invest- 
ments should have as consequents certain types 
and degrees of reinforcement. Homans states 
the related proposition as follows : 

The more to a man's disadvantage the rule 
of distributive justice fails of realization, 
the more likely he is to display the emo- 
tional behavior we call anger.® 

More is included, however, for if a man re- 
ceives rewards beyond those to which he con- 
siders himself entitled, he is likely to experi- 
ence guilt feelings. 

Translating this proposition into computer 
routines posed some of the most interesting 
problems we have yet encountered in working 
with HOMUNCULUS. In effect, the list struc- 
tures of our agents had to be programmed to 
have consciences, and they had to include a 
repertoire of appropriate anger responses. 

In essence, our programmed interpretation of 
this proposition asks whether a stimulus activ- 
ity is appropriate in the given circumstances 
(P5, Box I, Figure 1). If so, then the person 
receiving it can process it as George did Ted's 
request, which he considered appropriate. If, 
however, the stimulus activity is judged inap- 
propriate, then more complex behavior results. 
To illustrate this, let us shift to a description 
cf the interactions between George and Tom, 
another worker in the same agency. 

It is an accepted office norm that a worker 
who asks for help should do so openly in a man- 


ner acknowledging the superiority of his con- 
sultant with respect to the given problem. 
Tom, however, has been seeking aid from 
George in a rather devious manner, coming to 
George with “an interesting problem" and say- 
ing he would like to see whether George arrives 
at the same solution as he. This has occurred 
three times in the recent past, and on each 
occasion, Tom has greeted George's suggested 
solution with the comment, “Yes, you reached 
the same conclusions I did." George decides 
Tom is violating the norms of fair exchange by 
evading the cost of thanking him for his assist- 
ance and conceding his superiority. The fourth 
time Tom presents him with an interesting 
problem George angrily responds, “Look, why 
don't you do your own work !" 

This description, of course, does not answer 
the question of how the computer is pro- 
grammed to behave in such an all-too-human 
way. George is programmed to treat time spent 
solving a problem presented by another worker 
as being help to that person for which recogni- 
tion and social approval are due. When his 
colleague responds to his efforts with an unre- 
warding confirmation that he arrived at the 
same conclusion, George finds this input inap- 
propriate in terms of his expectations regard- 
ing distributive justice. Therefore, routines 
processing Proposition 5 change George's image 
list of Tom so that next time he expects greater 
recognition and thanks than normal to atone 
for the present evasion. After three repetitions 
of this interaction sequence the discrepancy be- 
tween Tom's behavior and George's expectations 
will be so great that when George evaluates 
Tom's response he will plant a signal in his 
image list of Tom to indicate that interacting 
with him is not rewarding because Tom violates 
group norms. 

The next time Tom asks for an opinion after 
this warning signal has been set, George will 
respond by displaying anger or by storing up 
aggression to be expressed against someone 
else. In the computer program an anger re- 
sponse involves emitting behavior punitive to 
another person. But before actively punishing 
Tom, George will first assess the consequences 
to himself of such behavior. In one possible 
interaction sequence, if George finds that Tom 
is in favor with George's own boss, he may sup- 
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press his aggression at the moment and then 
release it the next time he interacts with a 
subordinate. 

The routines processing the negative branch 
of Propostion 5 (Box II, Figure 4) thus not 
only modify image lists but also use some of the 
routines from the other propositions to evalu- 
ate the probable consequences of direct anger 
responses. Depending on the outcome of this 
processing, the program either proceeds to 
Proposition 1 or the interaction is terminated. 

Once HOMUNCULUS was operating effec- 
tively — so that it simulated behavior character- 
istic of the civil service office discussed earlier — 
we faced the problem of how the model could 
be used to provide a more general test of Ho- 
mans’ theory and also to simulate across a 
broader range of experiments involving both 
individual decision-making and group interac- 
tion. Actually, the model as first completed 
could have been used; however, the task of 
building a complete repertoire of activities ap- 
peared overwhelming, especially when we con- 
fronted the need to provide to each group par- 
ticipant with a complete history of activities 
received from and emitted to each of the others 
plus the values and costs of each activity in 
each situation. 

In our new program of activities we have 
adopted a scheme of twelve categories which 
appear to encompass all problem-solving activi- 
ties.^ These categories include six dimensions 
of group interaction: 1) communication prob- 
lems where an individual either gives orienta- 
tion (information, repetition, clarification, and 
confirmation) or asks for orientation; 2) eval- 
uation problems where an individual either 
offers an opinion (evaluation, analysis, and 
expression of feeling) or asks for an opinion; 
3) control problems where an individual either 
presents a suggestion (direction and possible 
ways of action) or asks for a suggestion; 4) 
decision problems, where an individual either 
agrees (shows passive acceptance, understands, 
concurs, and complies) or disagrees; 5) tension 
reduction problems where an individual either 
releases tension (jokes, laughs, and is satisfied) 
or displays tension; and 6) reintegration prob- 
lems where an individual either shows soli- 


darity (by raising the other’s status and by 
giving help and reward) or shows antagonism. 

On the basis of laboratory studies of small 
groups we know the approximate frequency of 
each type of activity, the changes in relative 
frequency of each type of activity through time 
as a group interacts, and the types of responses 
likely to occur to each type of activity. This 
information is programmed into reference 
groups, and data such as the appropriateness 
of a given response to an activity differ for 
various reference groups — e.g., for manage- 
ment, as contrasted to a labor union. When the 
individual participant does not have in his own 
repertoire sufficient information to select a re- 
sponse to a stimulus, he checks with his refer- 
ence group to find one to consider. 

As the program is now written we no longer 
specify the first actor and action. Currently we 
are running interactions among triads. When 
the program begins operating, control is turned 
over to the master executive routine. This 
checks to determine whether a member of the 
triad needs to act. If so, control is turned over 
to the actor. If not, the need for interaction of 
each member is incremented by a figure deter- 
mined by his basic need for social interaction 
plus his energy level. The program cycles until 
some member passes the threshold level, and 
then he assumes control. 

If the initiator knows one or both of the other 
members of the triad he selects the one with 
whom to open inte^-action on the basis of expec- 
tations for securing a rewarding response. If 
the group is composed of unacquainted mem- 
bers, the initiator evaluates the other two and 
selects the one whose characteristics lead to a 
prediction of rewarding interaction. This in- 
volves assessing the “personas” or public images 
of the other members and comparing their 
characteristics with his own self-image to deter- 
mine compatibility. 

If the initiator knows the person he has se- 
lected, he usually chooses an activity to emit on 
the basis of past experience. Otherwise, he con- 
sults his primary reference group to find an 
activity likely to lead to positive reinforcement 
when emitted to a person like the one chosen — 
that is, to an individual who is, say, more skilled 
in the task than he, less skilled in social interac- 
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tion, one level above him in social background, 
and an adherent of similar reference groups. 

Interaction thus begins and proceeds accord- 
ing to the flow diagram discussed earlier. The 
propositions guiding behavior remain the same; 
however, because of refinements introduced — 
for example, in the routines enabling one mem- 
ber to interrupt another before he has oppor- 
tunity to emit a chosen activity — ^the process 
has become more complex. A programmed 
member may keep in available memory at least 
two sorts of activities he may wish to emit: a 
normally selected response which was pre- 
vented by someone else’s interruption, or an 
anger response which the individual suppressed 
because of the given circumstances. The need 
to emit such thwarted responses fades as other 
activities intervene, so that after five activities 
have been emitted by others, the individual 
‘‘forgets'’ having been interrupted and goes on 
to select other activities. However, the mere 
fact of being interrupted raises his need to 
participate, and increases the likeihood of his 
interrupting in turn to carry out his previously 
selected activity or to shift his attention to 
other activities. 

Discontinuity and annoyance responses are 
likely to arise when activities are focused on 
social-emotional areas and when a series of 
interruptions occur. When the focus is on 
task areas, interruptions may lead to annoy- 
ance responses, but the discontinuity usually 
does not appear because the activities continue 
within the context defined by the given task. 
Memory of anger responses does not deteriorate 
so rapidly, but even this disappears after a 
series of twenty activities. 

We mentioned that at present we are running 
experiments dealing with three-person groups 
and discussed the processes through which in- 
teraction within the group is initiated. Some of 
the findings in this area have proved to be of 
theoretical interest. For instance, a group of 
three unacquainted persons usually breaks down 
into a pair and an isolate. On the other hand, a 
group of three already acquainted persons re- 
mains a group of three if all the pair relation- 
ships are positive. If the pair relationships are 
not all positive, the outcome is more complex 
and may range from a relatively unstable group 


of three to three isolates who never become a 
group. 

Outcomes from such interactions using 
HOMUNCULUS help explain what happens. 
Sooner or later a niember of the unacquainted 
group initiates interaction with one of the 
others. Each time this pair's interactions prove 
rewarding, information is stored in their image 
lists which lessens the cost of finding rewarding 
acts to emit to each other and increases the ex- 
pectation of a rewarding response. Thus the 
rate of interaction between them increases, the 
repertoire of rewarding activities increases, and 
the likelihood of the bond’s being disrupted 
lessens. The third man does not have the oppor- 
tunity to let the other two know how rewarding 
he might be and remains an enigma. Thus the 
interaction leads to a dyad and an isolate. 

But if the first interaction is punishing to 
one or both of the two involved, the probability 
becomes higher that interaction will be initiated 
with the third person. Assuming that his values 
and norms make rewarding interaction likely, 
then whichever one captures him first will, in 
most instances, form a dyad with him and leave 
the slower moving man as isolate. If no re- 
warding pair is found, then in most circum- 
stances a group never forms. 

With a group of three persons who already 
know and like each other, the same principles 
lead to different outcomes. Each has already 
established a repertoire of activities that he 
knows he can emit to the other two and that will 
be rewarded. Therefore, when there is slight 
satiation with the rewards one of them can 
provide, attention is temporarily transferred 
to the other. Occasionally, also, the interrupt 
mechanism leads the one not yet active in the 
group to join the conversation. Since the others 
have experienced rewarding interactions with 
him, he is not rebuffed. 

In reaching beyond the simulation of small- 
group laboratory experiments, we have begun 
planning experiments to use HOMUNCULUS 
to test theories of organization design. None of 
these will be programmed and on the machine 
for at least a year, and probably longer. This 
appears to us to be a particularly promising 
area in which to blend contributions to socio- 
logical theory and to organizational practices. 
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As our first study in this area we plan to ex- 
amine the relative effectiveness of the tradi- 
tional authoritarian hierarchy vs. the multiple- 
overlapping-group design advocated by Rensis 
Likert. 

In the traditional structure, communication 
occurs between adjacent men up and down the 
line, with information transmitted up, decisions 
made at the top, and instructions flowing down 
the line. Theoretically, lateral communication 
does not occur. In the overlapping-group or- 
ganization, each member within a group is in 
regular communication with every other mem- 
ber ; decisions are reached by the group rather 
than the superordinate ; and the members are 
responsible for passing on the information to 
other groups of which they are members. 

Computer simulation of decision-making ac- 
tivities by both types of organization is expected 
to produce information showing the relative 
efficiency of the two designs. Such factors are 
considered as the amount of information avail- 
able to those making decisions ; the effect of the 
decision-making process on the motivation of 
the workers to perform ; and the impact of each 
type structure on the satisfaction of individual 
workers, on their sentiments toward each other, 
and on their sentiments toward the organiza- 
tion. 

CONCLUSION 

In our discussion of HOMUNCULUS as an 
application of computer simulation to problems 
of sociological or social-psychological theory we 
have tried to demonstrate that research involv- 
ing computer models not only plays the passive 
role of testing and verifying theory ; it performs 
active functions for the development of theory. 
Formulating a theory as a computer model 
affords one a relatively tractable representation 
and possibly a more meaningful conceptualiza- 
tion, having increased precision as a result of 
the clarification of concepts the programming 
process necessitates. This organizational func- 
tion of the model is further enhanced by the 
computer’s capacity actually to set the theoreti- 
cal processes in motion and output data gener- 


ated as logical consequences of hypothesized 
processes. While investigating the extended 
logical consequences of a theoretical system is 
in itself a rewarding activity for a social theo- 
rist, another benefit involves the possibility of 
generating unanticipated findings leading to 
refinement of theoretical constructs or elabora- 
tion of a theory and to experimentation directed 
to the verification of new hypotheses. This pos- 
sible serendipity bonus in addition to the in- 
creased precision, tractability, and dynamic 
capacity of a computer model gives the social 
theorist ample reward for the cost of transla- 
tion from verbal formulation. 

REFERENCES 

1. Bales, Robert F. Interaction Process 
Analysis: A Method for the Study of Small 
Groups. Cambridge, Mass.: Addison-Wes- 
ley Co., Inc., 1950. 

2. Blau, Peter M. The Dynamics of Bu- 
reaucracy, Chicago ; University of Chicago 
Press, 1955. 

3. Gullahorn, John T., and Gullahorn, 
Jeanne E. 'The Computer as a Tool for 
Theory Development.’’ Santa Monica, Cali- 
fornia : System Development Corporation, 
SP-817, June 5, 1962. To appear in Dell 
Hymes, editor. Uses of Computers in An- 
thropology. The Hague, Netherlands: 
Mouton and Company, in press. 

4 . , Computer Model of Elemen- 

tary Social Behavior,” in Edward Feigen- 
baum and Julian Feldman, editors. Com- 
puters and Thought. New York:McGraw- 
Hill, 1963. Also reprinted in Behavioral 
Science, October, 1963, 8, No. 4, pp. 354- 
362. 

5. Homans, George C. Social Behavior: Its 
Elementary Forms. New York: Harcourt, 
Brace & World, 1961, pp. 7, 53, 54, 55, 75. 

6. Likert, Rensis. New Patterns of Manage- 

ment. New York: McGraw-Hill, 1961. 

7. Newell, Allen (Ed.). Information Proc- 

essing Language — V Manual, Englewood 
Cliffs, New Jersey: Prentice-Hall, 1961. 




REAL-TIME COMPUTER STUDIES 
OF BARGAINING BEHAVIOR: 

THE EFFECTS OF THREAT UPON BARGAINING* 

Robert J. Meeker, Gerald H. Shure, and William H. Moore, Jr. 

System Development Corporation 
Santa Monica, California 


INTRODUCTION 

Most behavioral scientists are well aware of 
computer technology as a means of reducing 
and otherwise analyzing empirically derived 
data. A small number have also discovered, and 
enthusiastically endorsed, the computer’s poten- 
tial for simulation of analytic models. But little 
has been made of yet another application of 
com.puter technology in the behavioral sciences, 
namely, the use of computers for data acquisi- 
tion. 

To anyone familiar with the development of 
real-time data processing, the prospect of a 
computer-administered experiment suggests a 
general increase in efficiency and flexibility over 
the manual mode of experimentation. One tends 
to think immediately of greater speed, greater 
accuracy, and, in some instances, greater econ- 
omy for a research program ; but we are empha- 
sizing the fact that certain types of behavioral 
studies can be greatly broadened and enhanced 
by exploiting the computer’s efficiency and 
ability to gather the desired data. In this paper 
we shall develop the thesis that computer- 
administered experimentation permits us to 
gather an important type of behavioral data 
which are now practically unobtainable ; as evi- 
dence we shall cite our computer-based empiri- 
cal research on bargaining and negotiation be- 
havior. 


Description of the Project 

Our research is designed to study bargaining 
and negotiation behavior, particularly the dy- 
namic interaction process that takes place in 
mixed-motive (non-zero sum) bargaining situa- 
tions. This study utilizes the computer for 
real-time experimental control and asaessment 
of subjects at critical points during bargaining 
situations. The project goal was to develop a 
general computer-based experimental vehicle 
which would provide unique opportunities to 
study bargaining and negotiation behavior and 
relate it to social-psychological factors (e.g., 
threat, trust, cooperation, status, power), per- 
sonality variables, and game-strategic charac- 
teristics and tactical moves. 

General Problem Background 

Bargaining and negotiation behavior is a 
striking example of an area of study which can 
be broadened by having access to data that have 
heretofore been elusive. Investigations in this 
area have typically been confined to data on the 
overt behavior of the bargainers (i.e., the bids, 
moves, outcomes, rates of response, and so on) ; 
yet investigators, theoreticians, and bargainers 
themselves recognize that overt behavior is di- 
rectly related to subjective attitudes, intentions, 
expectations, and perceptions. Clearly, in the 
temporal sequence of bargaining exchanges. 


* This research was supported by the Advanced Research Projects Agency, under contract SD-97. 


115 



116 


PROCEEDINGS— SPRING JOINT COMPUTER CONFERENCE, 1964 


participants shift their evaluation of outcomes, 
alter their goals, and attempt to induce their 
opponents to alter theirs. By these criteria, a 
complete analysis of bargaining requires not 
only data on the overt pattern and sequence 
of verbal exchanges, moves and countermoves, 
but also a parallel assessment of the partici- 
pants' changing intentions, expectations, and 
perceptions of relative status and opponent be- 
havior. 

Consequently, we must ask the bargainer 
what he is doing, or thinks he is doing, what he 
plans to do, and why. The importance of these 
kinds of ‘‘subjective" data is, regrettably, 
matched by the difficulties of collecting and 
interpreting them. To obtain useful answers 
from a subject, questions must be pertinent to 
what has been going on, unambiguously stated, 
and not be unduly suggestive. Unfortunately, 
it is not yet economically feasible to put each 
experimental subject on a psychoanalyst's 
couch. But even if it were feasible, the highly 
personalized interrogation creates additional 
problems of its own — those of comparability of 
questions and answers and the effect of the 
presence of the interrogator on the responses 
made by the subject. Small wonder that psy- 
chologists have developed strong objections to 
methods for obtaining subjective data during 
the ongoing experiment. These objections, eco- 
nomic and methodological, have virtually ex- 
cluded such data even from studies for which 
they are most pertinent. 

Another important aspect of bargaining and 
negotiation is its dynamic nature. Two free- 
acting agents may effect an endless variety of 
interresponse patterns. Such dynamic inter- 
action phenomena are important for the under- 
standing of behavior in a variety of social- 
cognitive situations ; yet the traditional methods 
for recording and analysis of social interaction 
data are extremely costly and place exorbitant, 
time demands on highly trained professional 
personnel. 

In our studies, the computer is used to over- 
come these difficulties by mediating and moni- 
toring the communications between experimen- 
tal subjects. Since all subject interactions are 
conducted via the computer, it is easier to 
oversee, record, analyze, and even control the 


interaction process. Additionally, the computer 
capacity for rapid recording and analysis is 
used for constant monitoring of subject be- 
havior ; and this, in turn, becomes the basis for 
selective inter_rogation, by the computer, of sub- 
jects on their individual intentions, perceptions, 
motives, and expectations. Thus, the computer 
serves the unique function of a standardized in- 
terviewer which carefully, but unobtrusively, 
monitors all participating subjects, detects all 
situations about which further information is 
desired, and then selectively asks questions 
which are relevant to what has just happened 
to the subject. The impersonal atmosphere 
eliminates in some measure the problems asso- 
ciated with the presence of the examiner and 
would appear to encourage freer responses. 

Definition of the Bargaining Sitiuition 
and the Possible Effects of Threat 

Three essential characteristics of a bargain- 
ing situation may be identified: (1) For both 
parties there are a number of possible agree- 
ments and each party would be better off, or no 
worse off, in reaching an agreement than in 
failing to do so. In game theory terms this 
means that bargaining is involved with varia- 
ble-sum games, not constant-sum games. That 
is, the sum of payoffs is not fixed — more for one 
player does not necessarily mean less for the 
other. (2) For each party there is a range (at 
least two) of such potential agreements. (3) 
Each party has preferences or interests with 
regard to the possible agreements that are gen- 
erally opposite to those held by the other party ; 
at the very least both players cannot be maxi- 
mally satisfied at the same time. 

Bargaining is thus a situation in which the 
participants have mixed motives towards one 
another. It is neither purely coordinative or 
cooperative, nor purely competitive. That is, 
the ability of one participant to gain his ends 
is dependent to an important degree on the 
choice of decisions that the other participant 
will make. 

The bargaining processing itself may be 
either explicit or tacit. Typically, when we 
think of bargaining, we think of the explicit 
mode of direct verbal exchange — the dialogue 
of bids, counterbids, concessions and compro- 
mise until an agreement is reached. On the 
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other hand, one can bargain in situations where 
there is little or no opportunity (or no desire) 
for direct verbal exchange. When drivers might 
meet at an intersection, or competitors engage 
in a price war, their dialogue may be limited to 
the actions that they take. Our experimental 
situation focuses on this tacit form of bar- 
gaining. 

Bargaining situations may also involve 
threats — a. means of directly signalling and/or 
im.posing loss or damage on the other bargainer. 
Such means might entail mutual loss as in a 
strike or an extortion attempt. Speculation on 
the effects <of threat on bargaining runs in two 
directions. One position tends to underscore the 
deleterious effects of threat. The threat spiral- 
ing theory predicts a pattern of escalation 
which leads to disrupting hostility, claiming 
that perceived threats (whether intended as 
such or not) will beget counterthreats. In this 
framework, the mere possession of a threat is 
sufficient to initiate the escalation process. 

In contrast, one finds among other writers an 
almost sanguine acceptance of coercive and 
threatening techniques as bargaining tools. In 
this view, since every bargaining situation in- 
volves some minimal aspects of threat — at the 
very least a bargainer can refuse to reach any 
agreement — threats present themselves as nat- 
ural and necessary means of bargaining; means, 
which can and will be used to effectively define 
and convey importance of an outcome, degree 
of resolve, or various other facets of the bar- 
gaining relationship. 

One finds then what appears to be, on the 
surface at least, inconsistent expectations re- 
garding the employment of threat. To the advo- 
cate of its use in bargaining situations, coercion 
is specifically designed to influence the “rational 
decisions'’ of the other party by controlling his 
expectations of outcomes which will follow from 
his actions. Contrasted to this, we find descrip- 
tions of threat-evoked behavior which is any- 
thing but rational and which leads to conse- 
quences desired by none. Since neither party 
to this argument has clearly specified the condi- 
tions which bound the range of his generaliza- 
tions, the extent of the disagreement is not 
apparent. Since both views cannot simultane- 
ously hold for the same situation, it becomes 


important to determine empirically the bar- 
gaining conditions under which threat will have 
either of these effects. 

Laboratory Studies on the Effects of Threat 

In our investigations we have focused on the 
effects of threat on interpersonal bargaining. 
In this regard we were particularly interested 
in the widely cited findings of a series of experi- 
ments by Deutsch and Krauss.^’ ^ 

In their experiments a bargaining game was 
played under three conditions: (1) In the bi- 
lateral threat condition, both players could take 
an action which directly imposed an obstacle to 
the other's making money. (2) In a unilateral 
threat condition, only one of the players had the 
means of imposing the obstacle. (3) In a third 
condition, neither individual had a direct threat 
capability. 

The following results were obtained : Where 
neither subject possessed a threat capability, 
the profits increased over trials and were posi- 
tive for both players. Where one of the players 
possessed the threat capability, the profits were 
negative in the early trials, but improved and 
became positive for both players. In the bi- 
lateral threat condition, there was no improve- 
ment over trials and earnings remained nega- 
tive throughout — that is, both players lost 
money instead of making money. 

From the Yesults of their bargaining game, 
Deutsch and Krauss derive a three-step process 
occurring in H:hreat-counterthreat spiraling: 
(1) If there is a conflict of interests, and threats 
are available, they are likely to be used. (2) 
The threatened person, if he feels superior or 
equal in status to the threatener, will feel hos- 
tile and will use counterthreat or show increased 
resistance to yielding. (3) As a result, further 
intensification of the competitive interest of the 
bargainers occurs and reduces the likelihood of 
their arriving at a cooperative agreement. 

We should note that this hypothesis depends 
on references to subjective attitudes, expecta- 
tions, intentions and perceptions. Reviewing 
their experiment, we speculated about other 
hypotheses — equally dependent on intentions 
and perceptions — which might also account for 
these results. Our reservations would not have 
been satisfied by a simple straightforward repli- 
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cation of their experiments, for even supposing 
an exact replication of results down to the level 
of moves and counter-moves, we should still 
want to know whether the attendant pattern of 
intentions and perceptions support their hy- 
potheses regarding the perception of threat and 
the escalation of hostility. Indeed, this is pre- 
cisely the sort of question for which we can 
acquire direct data through the use of com- 
puter-administered experimentation. 

To illustrate how these capabilities can be 
implemented and, used, we turn to a considera- 
tion of our experimental vehicle and the details 
of our investigations on the effects of threats in 
bargaining behavior. 

General Experimental Gaming Vehicle 

Experiments were carried out in the Systems 
Simulation Research Laboratory of the System 
Development Corporation. The equipment con- 
figuration included the Philco 2000 computer, 
computer-tied 8-by-l 1-inch television consoles, 
and associated switch insertion response con- 
soles. Computer programs pair as many as 24 
subjects who then play against each other. Sub- 
jects can be considered ''on line” with the com- 
puter. They send messages (moves, bids, 
threats, offers) to their paired opponent via 
switch insertions which are displayed on the 
receiver's TV console. Computer programs pre- 
sent the game situations to the subjects, assist 
in umpiring of legal moves, provide displays of 
game-relevant information, record all moves, 
messages and times of response, and probe the 
subject's bases for his actions. 

Procedtires of Experimentation 

Our experimental situation incorporated the 
formal features of the Deutsch and Krauss ex- 
periment, although the simulated situation was 
different. In our Communication Gama, sub- 
jects were told that they were going to perform 
as operators in a communication system. In 
each trial, the subject's task was to transmit 
messages from a set of one six-letter and two 
four-letter words. Each subject was informed 
that he was assigned to a communication chan- 
nel which would also be used by one other 
subject. This channel, common to a pair of 
subjects, had a total storage capacity of six 
units. The six-unit channel storage limitation 
prevented concurrent transmission of messages 
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Figure 1. Communications Game. 


by both subjects. Figure 1 illustrates the game 
moves and relationship between the bargainers. 

An individual subject could earn 25 cents for 
transmitting a six-unit message, ten cents for 
each four-unit message, or nothing if he failed 
to transmit any complete message. Transmis- 
sion rules resricted an individual operator to 
earning a maximum of 25 cents and a pair of 
subjects to joint earnings of 35 cents per work 
period. 

Work periods were made up of 15 "joint- 
action” turns during which both subjects would 
take their actions. In each turn, a subject chose 
from one of three action alternatives : inserting 
a letter unit into channel storage, withdrawing 
a letter unit from channel storage, or passing 
(taking no action). A letter unit could be in- 
serted successfully only if there were sufficient 
space in storage (i.e., the six-unit storage ca- 
pacity could not be exceeded). If attempted 
insertion (s), together with the units previously 
in storage, exceeded the channel capacity of six 
units, then an “overload” resulted, and this indi- 
cation was displayed as the reason that the 
attempted insertion (s) had not entered chan- 
nel storage. In effect, subjects had to work out 
an arrangement for sharing the channel. 

In the “threat” conditions, both subjects were 
provided with an additional “block” action 
which prevented the other operator's attempted 
insertions from entering channel storage. The 
effect of the block action was enduring (it did 
not require reinstatement to remain in effect). 
Both subjects could independently take block 
actions on any turn and maintain them for as 
long as they wished, or remove their own blocks, 
but each subject could initiate the block only 
once in any given work period. 

Subjects received instructions in groups rang- 
ing in size from 18 to 24 members, were as- 
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signed to individual cubicles, and did not know 
with whom they had been paired. Each subject 
had a switch insertion box for taking actions 
and a television receiver for the display of chan- 
nel status and the results of their attempted 
insertions. Subjects had no other means of 
direct communication with one another. 

All subjects were explicitly told to make as 
much money as possible regardless of how much 
the other operator made. After each trial they 
were informed what they each had transmitted. 
There was real money at stake in all but one 
condition. 

The computer program permitted each pair 
to proceed at its own rate of response. Follow- 
ing each work period, individualized computer- 
designated questions were displayed to the sub- 
jects to which they responded by means of their 
switch insertion boxes. The questions presented 
were contingent upon moves taken and the re- 
sults of the immediately preceding work period, 
and focused upon perceived significance of these 
events. 

Seven experimental conditions were studied. 
Six of these were variations of the “threat” 
condition involving bilateral availability of the 
block. In all of the threat conditions, the action, 
if maintained, had the direct effect of prevent- 
ing the other bargainer from making any 
money. The seventh was a “no threat” condi- 
tion in which neither operator possessed a block 
action with which to control the other opera- 
tor’s behavior. Since the block conditions did 
not differ significantly from one another, they 
have been combined for the present discussion 
and compared with the results obtained for the 
“no threat” condition. 

Results and Discussion 

Having mirrored the salient features of the 
Deutseh and Krauss game, we may determine 
whether any of the steps described by Deutseh 
and Krauss are found in our data. Let us look 
first at the six conditions where threat action 
i(the block) was available. 

If threat is available, will it be used? The, 
answer is a resounding “yes.” Ninety-three per 
cent of the 19Jt subjects use it on one or more 
occasions during the 20 trials. Furthermore, 
there is little hesitation in employing it. The 



FIRST TnAL USED 

Figure 2. Trial of First Block Action. 

median trial of initial use is the first trial. (See 
Figure 2.) 

Will the threat evoke counter threat? If one 
subject employs the block, does it evoke recipro- 
cation? Again the answer would appear to be 
clearly affirmative. Of the 95 pairs where the 
block was employed, only 11 subjects never em- 
ployed the block in return. Furthermore, the 
median latency of counter response is short, 
with two-thirds of the subjects responding on 
the same or immediately following trial. (See 
Figure 3.) 

Continuing to look exclusively at those ex- 
perimental conditions in which the block action 
is available, we compared the bargaining out- 
come for those pairs where both members use 
the threat with those in which one or both vol- 
untarily restrain from using it for the full 20 
trials (see Figure 4). A significantly lower 
mean joint payoff was earned by those pairs 
who use the threat bilaterally than by those 
pairs where one or both subjects avoid the use 
of threat. 

These findings were consistent with expecta- 
tions derived from the threat-counterthreat es- 



TRM.S 

Figure 3. Distribution of Trial Latency of 
Counter- Blocking. 




120 


PROCEEDINGS— SPRING JOINT COMPUTER CONFERENCE, 1964 


PAIR 

BLOCK USAGE 

MEAN JOINT 
PAYOFF 

N 

BILATERAL 

$5.47 

114 

UNILATERAL 

AND NEITHER 

$6.73 

13 

Mdiff 

$1.26 



Figure 4. Block Use Related to Joint Payoff for 
Six Block Conditions. 

calation pattern. On the other hand, other find- 
ings are clearly at variance with the threat- 
counterthreat spiraling hypothesis. 

1. A comparison of the block and no-block 
conditions showed the reverse of the predicted 
threat spiraling. The pairs who had the block 
action available had a significantly higher joint 
payoff than did those without it. (See Fig- 
ure 5.) This result is a direct reversal of the 
Deutsch and Krauss findings. 

2. Again focusing only on the block condi- 
tions and those pairs who had not yet reached 
a cooperative agreement in the first five trials 
— that is, those pairs who were having some 
difficulty with each other — we compared those 
pairs who made more use of the block action 
during these first five trials with those who 
made less. In a complete reversal from expec- 
tations, we found that significantly more of the 
pairs who made extensive early use of block 
action subsequently reached a cooperative 
agreement. (See Figure 6.) Nor was there 
evidence that greater restraint, expressed in 
delaying the counterthreat response, tended to 
dampen the escalation of hostility or produce 
more favorable outcomes over those where less 
restraint was shown in taking a counterthreat 
action. We found no significant relationship 
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Figure 5. Mean Joint Payoff for No Block and 
Block Experimental Conditions. 


BLOCK USE 
FIRST 5 TRIALS 

COOPERATIVE AGREEMENT ^ 

AFTER 5 TRIALS 

NEVER 

HIGH 

18 

12 

LOW 

5 

15 


N = 50 
x2 = 5.92 

{p<.02) 

Figure 6. Frequency of Block Use (First Five Trials) 
Related to Subsequent Cooperative Agreement. 

between counterthreat response latency, meas- 
ured in number of trials, and joint outcome for 
each pair. It thus appears that greater restraint 
in early use or delay in counterthreat response 
does not dampen the escalation of hostility. 

These results indicated that there is no sim- 
ple relationship between the use of threats (as 
measured by blocking) and escalation of hos- 
tility (as assessed by difficulty in achieving a 
cooperative agreement) . 

Up to this point our data show the following : 
Subjects who have the block do better than 
those who do not; yet, those who have the block 
but do not use it do better than those who use it. 
This puzzlement is compounded by considering 
the additional fact just noted. Those who use 
the block more initially and with less restraint 
do significantly better than those who use it 
less or use it more cautiously. 

At this point our methodology of computer 
probes may permit us to do something which 
was not possible in previous studies. Since the 
computer has inquired of each subject into his 
intention and perceptions associated with each 
use of the block, we may turn to this informa- 
tion instead of engaging in further speculations 
about the meaning of our results. 

3. Generally speaking, how were the blocks 
intended and perceived ? The first striking find- 
ing is the frequency with which first blocks are 
used with a cooperative intent — specifically as 
a signal for coordinative purposes. Approxi- 
mately 70 per cent of all users included a co- 
operative reason among the three most impor- 
tant reasons for their initial block usage. In 
contrast, only 20 per cent failed to register any 






REAL-TIME COMPUTER STUDIES OF BARGAINING BEHAVIOR: 


121 


cooperative intent, indicating that their use of 
the block was exclusively intended to threaten 
or express some reactive hostility. 

That these self-ratings did not merely reflect 
a tendency to characterize one’s own intentions 
in positive terms is indicated by the fact that 
users with initially cooperative intent do sig- 
nificantly and markedly better in terms of sub- 
sequent cooperative agreement than do those 
who indicate no cooperative intent in the use of 
the block. (See Figure 7.) Thus, despite the 
fact that subjects were explicitly given an “indi- 
vidualistic” orientation and sizable sums of real 
money were at stake, a strongly cooperative 
orientation was carried into the game by most 
subjects. These findings suggest that many of 
the subjects, at least to some degree, substitute 
cooperative incentives in place of the individ- 
ualistic ones which the experimenter attempted 
to induce with instructions and monetary re- 
wards. 

Knowing the cooperative use our subjects 
made of the block action, it is now less surpris- 
ing that a comparison of the “block” and “no 
block” conditions showed the reverse of the 
Deutsch and Krauss results. It is clear, then, 
that the block action need not always have nega- 
tive meaning, and it would appear that it can 
be an actual aid to bargainers if they have the 
disposition to use it to achieve an optimal co- 
ordinate agreement (even, as in the present 
case, where pre-game ratings showed that our 
subjects predominantly viewed the block action 
as a hostile act) . 

4. Since so many of our subjects achieved an 
early and apparently easy cooperative agree- 
ment — they were willing to share the payoff 
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Figure 7. Intention of First Block Use (Within Pair) 
Related to Median Trial on Which a Cooperative Agree- 
ment Was Achieved. 


without conflictual confrontation and needed 
only to coordinate, not to bargain — we next con- 
sidered just those pairs of bargainers who do 
not show outward evidence of mutual coopera- 
tion (more specifically, those who had not 
reached a cooperative agreement by the fifth 
trial) to see what effects threat might have in 
this fuller bargaining context. These cases 
should be more interesting and germane for the 
Deutsch and Krauss hypotheses, for, as a class, 
these are the bargainers who show evidence of 
initial interference — whether it be for reason 
of incoordination or simply conflict of interest 
in doing so. Among these pairs, one would sup- 
pose that those who subsequently reached a co- 
operative agreement would exhibit an initially 
lower level of hostility than those who never 
achieved agreement. 

We have already seen that if the frequency 
of block usage is employed as an objective index 
of hostility, the findings run counter to this ex- 
pectation ; but since we now know that hostility 
cannot always be ascribed to block usage, let us 
look directly at the subjective indices of hos- 
tility for the same groups. We find at this level 
also there is no evidence for the hypothesis. 
Pairs which subsequently achieved a coopera- 
tive relationship show a higher instance of re- 
ported hostile intentions and perceptions over 
the first five trials than those who failed to 
achieve such a relationship. (See Figure 8.) 
While not statistically significant, the trend is 
clearly counter to the spiraling hypothesis. 

Finally, let’s consider one further possibility. 
From the threat-counterthreat hypothesis, one 
should expect escalation in subjective hostility 
among those pairs who subsequently failed to 
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Figure 8. Early Subjective Hostility Related to 
Subsequent Cooperative Agreement. 
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reach a cooperative agreement and evidence of 
de-escalation among those who achieved agree- 
ment. In fact, increase or decrease in subjective 
ratings of hostility, both perceived and intended, 
over the first five trials show no consistent rela- 
tionship to success or failure in achieving a 
cooperative agreement (see Figure 9). 

All of our findings — the predominantly co- 
operative intent and perception of block use; 
the fact that availability of threat is as- 
sociated with better outcome; the fact that, 
for pairs that do poorly, neither objective nor 
subjective indices provide evidence for the 
escalation of hostility in the early phases of 
bargaining — all of these would suggest strin- 
gent qualifications for the underlying mech- 
anisms of the threat-counterthreat hypothesis 
if it were to be maintained. If our results run 
counter to the spiraling hypothesis, what ex- 
planation can we offer to account for the vary- 
ing degrees of difficulty which our subjects had 
in reaching a bargaining agreement? 

It is obvious that pairs of cooperatively ori- 
ented bargainers can achieve a jointly accept- 
able agreement which is profitable to both. It is 
also the case that competitive pairs (in which 
both members want to earn more than half of 
the available payoff) tend to do very poorly. 

What happens, however, to those dyads where 
one member is well-intentioned and conciliatory 
while the other is out to get as much as he can? 
A naive extrapolation of the threat spiraling 
hypothesis would suggest that these pairs should 
do better than the competitive pairs. We have 
collected pre-game statements of plans by sub- 


FREQUENCY OF 

HOSTILE RATINGS 
DURING FIRST FIVE TRIALS 

COOPERATIVE AGREEMENT 



INCREASING 

(TRIALS 4 + 5>TRIALS 1 + 2) 

15 

13 

DECREASING 

(TRIALS 4 + 5 < TRIALS 1 + 2) 

00 

14 


N = 50 

x2>.io 


Figure 9. Early Increase and Decrease in Level of 
Subjective Hostility Related to Subsequent Cooperative 
Agreement. 


jects which provide a basis for such a com- 
parison. We compared competitive dyads with 
mixed dyads (where a subject planning to share 
the maximum joint payoff 50/50 is paired with 
one planning to earn more than one-half the 
maximum joint payoff). While the competitive 
pairs were slower to adopt a coordinating strat- 
egy, all pairs but one succeeded in doing so 
within the 20 trials. In contrast, 34 per cent of 
the mixed pairs failed to do so. Indeed, they 
have the poorest outcomes of all pair combina- 
tions based on pre-game plans. Similar findings 
Were obtained for pair comparison based on 
personality classification. It is not the dyads 
which are composed of two players with domi- 
nating strategies that have most difficulty; 
rather, it is those dyads where one member is 
well-intentioned and conciliatory and the other 
is dominating or belligerent. (See Figure 10.) 

As a tentative explanation of these findings, 
we propose that the well-intentioned and con- 
ciliatory member is initially reluctant to “force” 
cooperation — to employ his block as threats — 
and is ambivalent and oscillating when he does 
so. The subject who delays in responding to a 
threat with a counterthreat displays to the ag- 
gressive member a weak intention to resist and 
encourages him to persist in his original de- 
mands. These then become increasingly unac- 
ceptable to the conciliatory member. Two addi- 
tional findings support this interpretation. 

Let’s look at the situation of the cooperatively 
inclined bargainer who initially fails to report 
seeing any cooperative intent in the other bar- 
gainer’s use of the block. Should he counter- 
block immediately or try to nurture coopera- 
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Figure 10. Mean Number of Joint Zero Profit Trials 
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tion otherwise? Our data show his chances to 
achieve a cooperative agreement are signifi- 
cantly better if he responds more immediately 
(within two trials) with a counterblock than if 
he delays longer in doing so. Twenty-two out 
of 37 ‘‘immediate'' counterblockers concur on a 
cooperative agreement, whereas only one out 
of seven “delayers" do so. 

Finally, our explanation may help to recon- 
cile our otherwise puzzling finding — that 
greater block usage in the earlier trials is asso- 
ciated with greater likelihood of agreement. On 
further analysis we find that, on the average, 
those who never reached agreement used the 
block less than once every other trial during this 
period. Thus they employed their block only in 
a sporadic or intermittent fashion. 

In summary, while bargainers who resist the 
use of threats show the most favorable joint 
outcomes, these results appear to be related 
primarily to the pairing of subjects with pre- 
game cooperative dispositions. Rather than 
spiraling hostilities, a posture of firmness and 
determination expressed in a consistent and 
rapid response to threat with counterthreat may 
contain a belligerent player. Lack of resolution, 
as expressed in delay and oscillations in employ- 
ment of counterthreat to an aggressor, may sig- 
nificantly increase the likelihood of escalation 
of conflict. 

In follow-on studies we plan to investigate 
the conditions under which the exchange of 
threat and counterthreat actions lead to conflict 
escalation and/or to a dampening or reversal 
of the conflict ; to identify “points of no return" 
if these are present ; to assess the effect of mas- 
sive retaliatory capability on the use of smaller 
threat actions ; to determine under what condi- 
tions such a capability would exert a stabilizing 
or destabilizing influence on negotiations. 


In pursuing these goals, we also plan to ex- 
ploit further the unique advantages afforded by 
our computer approach. We are planning im- 
provements in two major directions: first, to 
improve techniques for eliciting subjective data 
through computer probes so as to minimize their 
interrupting features and increase the value 
of the information elicited; second, to gain ex- 
perimental control through computer simula- 
tion of one of the players in each pair. This will 
permit direct investigation of effects of indi- 
vidual styles of play (ruthless competitor, stra- 
tegic cooperator, irresolute cooperator, moral 
pacifist, etc.) ; this, in turn, will provide greater 
control of intermediate critical stages of the 
game, thus reducing the obscuring effects of 
unique patterns of play and improving ability 
to evaluate and compare results for different 
bargaining pairs. 

More important even than the specific find- 
ings in this study is the demonstration of the 
potential usefulness of the computer as a tool 
for gathering critical data in studies of social 
interaction. As a result, data become available 
which have been virtually unobtainable on a 
scale necessary for scientific investigation. 
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REAL TIME QUICK-LOOK ANALYSIS 
FOR THE OGO SATELLITES 

R, J. Coyle and J. K. Stewart 
Datatrol Corporation 
Silver Spring, Maryland 


Introduction 

The National Aeronautics and Space Admin- 
istration has designed a series of general pur- 
pose orbiting satellites which have the family 
name of Orbiting Geophysical Observatory 
(OGO). Each of these satellites is capable of 
carrying up to 50 scientific experiment devices, 
which transmit data to ground stations via a 
common telemetry channel. There are two 
tracking stations, at Rosman, North Carolina, 
and Fairbanks, Alaska, which will receive this 
telemetry, transmitting it to Central Control at 
Goddard Space Flight Center in Greenbelt, 
Maryland, at a data rate of up to 64 KC. 

General Description of the 
Programming System 

The requirement of the programming system 
for OGO was to provide quick-look analysis and 
control of the status of the spacecraft and se- 
lective experiments on board the satellite. (The 
tracking and orbit determination of OGO is not 
a function of this program.) All telemetry is 
recorded for further extensive analysis in non- 
real time on other equipment. There are several 
special purpose consoles attached to the com- 
puter to accept telemetry, and provide a means 
of communication to and from the spacecraft. 
These are appropriately called : PCM Telemetry 
input equipment. Control and Display Console, 
and Command Console. It is through the inte- 
gration of these consoles with the computer pro- 
gram that experiments on board the satellite 
may be selected to start, or terminate, by a com- 


mand from the computer. The selective analysis 
of those experiments transmitting data is ini- 
tiated by an input request from the Control and 
Display Console and selective spacecraft status 
analysis is handled in the same way. It is there- 
fore a requirement of the computer program 
that it be flexible enough to handle these many 
and various requests instantaneously. Since the 
bit rate of 64 KC means a new telemetry frame 
of data will arrive every 18 ms., the computer 
program.s must be able to m.ake maximum use 
of the capabilities of the computer. 

The SDS 920 computer was chosen by NASA 
to handle the quick-look analysis and control 
for OGO. The 920 is a small-scale computer 
with a 24-bit word and an 8 us. cycle time. It 
has one index register and uses single-address 
fixed-point arithmetic logic. It is capable of 
handling many kinds of input/output, to in- 
clude: magnetic tape, paper tape, punched 
cards, on-line typewriter, and high speed 
printer. The OGO 920 has 8 K of memory and 
32 channels of interrupts for input/output use. 
The OGO installation is illustrated in Figure 1. 

Functions of the Real Time 
Monitor Control System 

The programming system for OGO can be 
thought of in two distinct parts : the Real Time 
Monitor Control, and Experiment Processors. 
All routines necessary to connect the computer 
with the external environment are integrated 
into the Real Time Monitor Control system. It 
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Figure 1. OGO Central Control Installation. 

must service all interrupts, provide for the re- 
ceipt of all input including telemetry and man- 
ual input requests from the consoles, direct and 
send out all output to the appropriate receiving 
device, keep track of time, and determine the 
sequence in which all functions and processing 
is conducted. The Monitor Control does not 
process any of the data itself, but rather acts 
as a general purpose framework in which selec- 
tive routines perform the processing. These 
selective routines are called Experiment Proc- 
essors and are independent routines operating 
under control of the Monitor. These routines 
process telemetry input data, returning to 
Monitor formatted output for driving digital 
displays on the special purpose consoles, for 
printing, and for typing, as well as other infor- 
mation to the Monitor. 

The first distinct part of the programming 
system for OGO, the Monitor Control, is itself 
divided into three sections : the Schedule 
Program, Monitor Processors, and Interrupt 
Processors. The following is a definition of the 
functions and characteristics of these various 
sections. 

Monitor Schedule Program 

The Monitor Schedule Program consists of 
routines which collectively coordinate, super- 
vise, and schedule all processors in the system, 


utilizing a priority table of processors. It saves 
the status of the machine whenever an interrupt 
occurs, and facilitates a proper return from an 
interrupt by restoring the condition of the ma- 
chine at the point of interrupt. The heart of 
this scheme is the Monitor Schedule Routine 
whose function is to examine sequentially the 
entries in the priority table in order to deter- 
mine the next routine of highest priority to be 
processed. 

The priority table of processors consists of a 
group of words, or module, which identify and 
describe each processor with a reference in the 
table. Due to the difference of functions of the 
various processors, the modular approach allows 
for flexibility in assigning priority during the 
debugging stage of the system. By chaining 
these modules together, say by the first word of 
each module referencing the first word of the 
next module, the modules may be of various 
length (Fig. 2). 

In order for the Schedule Routine to carry 
out its function of determining the next proc- 
essor of highest priority with something to do, 
it must know the status of each processor in the 
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Figure 2. Make Up of First Word of the Modules. 
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priority table at any given instant of time. This 
is accomplished through the use of indicator 
bits in the module, say the left portion of the 
first word of each module (Fig. 2). The vari- 
ous states that a processor may be in are the 
following : 


‘in process’ —This processor was inter- 
rupted while it was process- 
ing, and entry should be made 
at the point of interrupt. 

‘ready’ —this processor now has some- 
thing to do; entry should be 
made at the beginning. 

‘suppressed’ — this processor should not be 
processed at this time. 

‘terminated’ — this processor has now com- 
pleted its processing. 

‘noit in core’ — ^this module is available for the 
addition of a new processor. 


These indicator bits, then, inform the Sched- 
ule Routine whether or not the referenced proc- 
essor is now available for processing. Since the 
priority table entries must be examined from 
the top each time return is made to the Sched- 
ule Routine, a rapid method of examining the 
indicator bits of each module was needed. To 
clarify the method that was devised, it is neces- 
sary to diverge for a moment and describe two 
uses of an instruction in the SDS 920 repertoire, 
which may be used to load the index register. 


A 


A 

B 


EAX A 
X Reg. = A 


EAX* A 
PZE 1 
X Reg. =: 1 


Immediate Addressing, 
where the address field of 
the instruction itself is 
placed in the index register. 

Direct Addressing, where 
the contents of location A, 
referenced in the address 
field of the instruction, is 
placed in the index register. 


EAX* A 
PZE* B 
PZE 2 
X Reg. = 2 


Indirect Addressing, where, 
if the contents of location 
A is in turn indirectly ad- 
dressed, the address field of 
A will again be used as the 
location from which to load 
the index register. 


The SDS 920 computer has the capability of 
unlimited indirect addressing ; that is, it is pos- 
sible to continue the above indirect addressing 


of an instruction for many steps, until a loca- 
tion is encountered that is not itself indirectly 
addressed. It is convenient then to choose the 
indicator bits of the modules discreetly to take 
advantage of this feature of the machine as 
well as the fact that the modules are chained 
together. If all states of a processor which indi- 
cate that it not now available for processing 
contain the indirect addressing bit as one of the 
indicator bits, then only those routines with 
something to do at the current time would be 
picked up for examination. For instance, the 
processor states of ‘suppression,’ ‘termination,’ 
and ‘not in core’ all result in there being noth- 
ing to do at this time on this particular proces- 
sor. 

In Figure 3, the consequences of giving an 
“EAX* A” would be that modules A, B, and C 
would be skipped and the first word of module 
D would be placed in the index register. For 
module A is ‘suppressed,’ module B is ‘not in 
core’ and module C is ‘terminated,” indicating 
that no processing should be done on these 
processors at this time. The contents of the 


index register actually contains the location of 

PRIORITY TABLE 
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Figure 8. Example of the Use of the Indicators. 
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the next module, due to the chaining criteria, 
but this presents no particular problem if ref- 
erence is always made to the words of the 
module in reverse; that is, -1,2 or -6,2 etc. 

Since it costs only one machine cycle to skip 
each successive module whose indirect address- 
ing bit is set, the result is a most efficient search 
of the priority table. The alternative would be 
to pick up the contents of the indicators of each 
module and then through various logical com- 
pare instructions determine whether any of 
these situations are the case. It is apparent that 
the scheme using the indirect addressing fea- 
ture of the computer facilitates spending the 
minimum time examining the entries of the 
rather lengthy priority table of processors. 

In addition to the indicator and chain word, 
each module also contains tjie first word loca- 
tion of the processor which it references, as well 
as a block of locations in which the condition of 
the machine is saved when this processor is 
interrupted. If the referenced processor has 
input/output functions, the module will addi- 
tionally contain the location of the I/O data 
which is stacked in a table for subsequent trans- 
fer or editing. If the processor is an Experi- 
ment Processor, it will also contain the location 
of flag words in the processor by which com- 
munication is made with the Monitor. That is, 
words by which each Experiment Processor can 
signal Monitor of its various output require- 
ments, and whether it has completed processing, 
etc. (Fig. 4). 

The many processors of the Real Time Con- 
trol System may be in various states during the 
course of real time operations so that it is im- 
possible for one processor to communicate di- 
rectly with another. This is especially true of 
I/O data and channel select requests. There- 
fore, two subroutines were devised in the Moni- 
tor Schedule Program to handle the passing of 
data from one processor to another. One sub- 
routine handles the stacking of data onto the 
receiving processor’s stacking buffer, while the 
other handles the unstacking of the next data 
to be processed. The barrel, or wraparound, 
method was chosen to facilitate the stacking 
and unstacking from these buffers. That is, a 
'first in-first out’ list type of scheme. To avoid 
moving the data around in core memory, since 
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Figure 4. Composition of the Modules. 


this is necessarily inefficient, the items which 
are stacked and unstacked in the buffers are the 
locations of the I/O data or channel select 
requests. 

Each I/O processor’s module, then, contains 
the top and bottom locations of the stacking 
buffer for that processor. It can be seen that 
these buffers may be variable in length from one 
processor to the next since each processor’s 
buffer is uniquely defined in its module. The 
module also contains the location of the first 
item to be unstacked as well as the next free 
location into which to stack. These locations 
may vary anywhere within the prescribed stack- 
ing buffers area, hence the name ‘barrel method’ 
(Fig. 5). 

When an item is unstacked, the ‘first’ loca- 
tion reference in the module is replaced by the 
‘first -)- 1’ location reference so that the next 
item is now referenced for unstackiiig. Simi- 
larly, when an item is stacked in a buffer at the 
‘next’ location, the ‘next’ location reference is 
then updated to the ‘next -j- 1’ location as the 
next free location in which to stack. The top 
and bottom locations, delineating the total 
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Figure 5. ‘Barrel-Like’ Composition of Stacking Buifers. 

buffer area, can be thought of as being only one 
unit apart, for the subroutines always treat the 
bottom location -f 1 as if it were the top loca- 
tion. The circular action has now been effected. 

Monitor Processors 

The Monitor Processors handle the input/ 
output requirements of the Real Time Monitor 
Control, as well as a few other minor functions. 
The SDS 920 has two buffered channels for 
input and output data transfer and 32 real time 
channels, all of which have interrupt capabili- 
ties. One of the buffered channels is used ex- 
clusively for the telemetry input, the other 
handles all other I/O functions, except those 
from the special purpose consoles, time inter- 
rupts, etc. Due to the difference in the rates of 
speed of the devices attached to this I/O chan- 
nel, it was decided to have a series of processors, 
each one associated with only one I/O device, 
rather than a single large I/O processor geared 
to the buffered channel. Each of these proces- 
sors, like all other processors, is under control 
of the Monitor Schedule Program. Its priority 
order is determined by the position of its 
module in the priority table and is in direct 
relationship with the speed of the I/O device 
to which it refers. For example, the priority of 
a processor handling the on-line printing is 
higher than the one handling the output mes- 
sages to the typewriter. 

Since only one channel select request can be 
executed by a buffered channel at a time, all 
of the Monitor I/O Processors hand off their 
I/O select requests to a single processor, rather 
than randomly selecting the channel themselves. 
The function of this processor is to select the 
channel with whatever request is next in line, 
regardless of the kind of I/O device indicated. 


This processor is guaranteed that the channel is 
ready to receive the select request, for as each 
request is serviced, this processor ‘suppresses’ 
itself. That is, it sets the indicator bits in its 
own module so that it will not be considered for 
processing again until it is ‘unsuppressed.’ 
When the interrupt from the completion of the 
data transfer on that channel occurs, the inter- 
rupt processor servicing this interrupt will 
‘unsuppress’ the channel selecting processor so 
that it may now continue with the next I/O 
request, since the channel is free to be rese- 
lected. This channel selecting processor has 
the highest priority of all the other Monitor 
Processors, thus allowing for the maximum use 
of the channel at all times. 

There are other Monitor Processors which 
handle the input/output to and from the special 
purpose Control and Display Console and the 
Command Console. These processors are con- 
trolled and executed in a straightforward man- 
ner since each function is handled by a unique 
interrupt through the 32 real time channels. 
There are eight such processors connected with 
these consoles. One additional Monitor Proc- 
essor handles time in the system. It maintains 
a gross time figure in minutes, and prints out 
a GMT time message periodically. It is espe- 
cially used to flag certain processors dependent 
on some interval of time that it is time to com- 
mence processing. This procedure is called 
‘readying’ a processor. 

Interrupt Processors 

The third category of the Monitor Control 
deals with processors which handle interrupts. 
Each Interrupt Processor is coordinated with a 
separate and unique interrupt channel. The 32 
channels of interrupts have a priority system 
of their own in that lower level channels may 
be interrupted by any higher level channel. 
This means that the corresponding Interrupt 
Processors also may be interrupted during the 
course of their processing if an interrupt of a 
higher priority comes in. Therefore, the Inter- 
rupt Processors are naturally coordinated with 
the demands of the interrupt to be processed as 
well as with the priority level of the channel 
selected to handle that interrupt. For instance, 
it was decided that the telemetry interrupt 
would be on the channel with the highest pri- 
ority, since a new word of telemetry comes into 
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the computer every 280 us. The one second 
time interrupts were given the second highest 
priority, and so on. 

Since all processors, whether Monitor Proc- 
essors, Interrupt Processors, or Experiment 
Processors, return control upon completion to 
the Monitor Schedule Routine, it was necessary 
to vary this requirement somewhat in the case 
of Interrupt Processors. For if an Interrupt 
Processor is itself interrupted, then return 
should be made to the point of interrupt. For 
interrupts are a demand to acknowledge some 
I/O data transfer and must be processed before 
the normal processing continues. This allows 
for smooth and coordinated interaction be- 
tween the different parts of the Real Time Mon- 
itor Control system. The technique of drawing 
a hypothetical line in memory was used to 
allow an Interrupt Processor to decide quickly 
whether it had interrupted out of another In- 
terrupt Processor, for all Interrupt Processors 
are placed above this 'line.’ If this is the case, 
return is made to the interrupted Interrupt 
Processor and not to the Schedule Routine. It 
logically follows that when all Interrupt Proc- 
essors have finished, the one which caused the 
first interrupt in time will be the last to finish 
and will return control to the Schedule Routine. 
One of the advantages of this scheme is that it 
is not necessary for the Interrupt Processors 
to run in a ‘disabled’ mode. That is, where the 
computer is prevented from receiving another 
interrupt by executing a special instruction. In 
fact, the ‘disable’ instruction is given rarely in 
this real time system, and when it is u^ed, the 
duration of the disabled mode is very short. 
For, since a new telemetry word comes into the 
computer every 280 us., it must not be disabled 
for a period of time equal to or greater than 
that. 

Experiment Processors 

The second distinct part of the programming 
system for OGO consists of all those selectable, 
independent routines called, collectively. Ex- 
periment Processors. The function of these 
Experiment Processors is to perform analysis 
on the telemetry input data, both of the experi- 
ments on board the satellite and the status of 
the spacecraft itself. These processors operate 
under control of the Monitor Control programs, 
returning, upon completion of processing each 


telemetry frame of data, to the Monitor Sched- 
ule Routine. Since there are many more Ex- 
periment Processors than would fit into core 
memory along with the permanent Monitor 
Control, it is the function of the Monitor to be 
able to add and delete selective Experiment 
Processors without disturbing the real time 
processing. A Monitor Processor, called the 
Real Time Load Processor, performs this func- 
tion. It is this real time load capability of the 
Monitor Control which affords the flexibility of 
operation required of the system. The Monitor 
Control can thus provide for maximum use of 
memory capacity as well as maximum use of 
processing time. 

Requirements of Experiment Processors 

All Experiment Processors place three gen- 
eral requirements on the Monitor. They must 
be able to obtain all data needed from the Moni- 
tor, including such parameters as time and 
orbital position of the satellite, as well as spe- 
cific data points from the telemetry frame. They 
must also be able to return the results of their 
processing back to the Monitor in the form of 
messages to be output on the on-line printer. 
Finally, they must be able to communicate with 
the Monitor that they have completed their 
processing or to request that special functions 
be accomplished, such as sending commands to 
the special purpose consoles. It is the function 
of the Real Time Load Processor to provide 
the capability of bringing such an Experiment 
Processor into memory when requested by de- 
pressing the appropriate button on the Control 
and Display Console, and to make all necessary 
connections and links between the processor 
and Monitor. 

Coding Rules 

A problem arose in the fact that these Experi- 
ment Processors were to be written by many 
different programmers. This would appear to 
involve a considerable amount of supervision of 
the coding used to protect the system from 
‘blow ups,’ but actually it was possible to han- 
dle this checking automatically by writing a 
service program called the Static Checker. As 
its name implies, the Static Checker checks the 
symbolic coding of an Experiment Processor to 
determine whether the programmer has com- 
plied with the various coding rules laid down 
for the system. 
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The rules to be followed by a programmer 
when writing an Experiment Processor are of 
two types: real time restrictions, which are 
necessary to protect the Monitor Control; and 
relocatable binary restrictions (since the proc- 
essor will be used in this form), which follow 
from the nature of relocation itself. The rules 
are ; 

Real Time Restrictions 

1. No I/O instructions 

2. No breakpoint switch tests 

3. No halts 

Relocatable Binary Restrictions 

1. No addresses of type : — Symbolic, Sym- 
bolic + Symbolic, or use of * or / 

2. No references to absolute memory loca- 
tions 

3. No negative constants or masks 

The real time restrictions are quite straight- 
forward, since all I/O is handled by the Moni- 
tor; breakpoint switches (sense switches) are 
reserved for Monitor use, and halts are strictly 
anathema in real time work. 



Figure 6. Example of Static Checker Diagnostics. 


this purpose, and this is, in fact, the scheme 
commonly used on the 920. 

For data words, however, the full 24 bits are 
used and it is for this reason that Experiment 
Processors needing negative constants or masks 
with a left-most one must generate them. This 
is the only real coding restriction placed on the 
programmer and in practice is not serious, 
since the 920 has the capability of forming both 
I's and 2’s complements with single instruc- 
tions. 

The Static Checker accepts a symbolic deck 
and produces a listing containing error diag- 
nostics, if any (Fig. 6). 


The first set of relocatable binary restrictions 
are the familiar ones inherent in relocation and 
the second restriction, forbidding absolute ad- 
dresses, is also fairly standard in this kind of 
work. The third restriction, however, is some- 
what unusual and arises from the unique man- 
ner in which relocation is handled in the SDS 
920. A brief look at the makeup of an SDS 920 
instruction word will serve to illustrate this. 

A 24-bit instruction word in the 920 is 
broken down as follows : 


|r 

X 

0 


A 


0 1 2 8 9 10 23 Bits 

The fields are, from right to left, a 14-bit 
address field, one bit for indirect addressing, a 
7-bit operation field, one bit for indexing, and 
finally, the left-most bit which is not inter- 
preted when the instruction is executed. That 
is, the instructions 0 35 01000 and 4 35 01000 
are interpreted by the computer as being iden- 
tical. Since the address field is the only one that 
need be relocated in any instruction, the left- 
most, or sign bit of the word may be used for 


Formatting 

It was apparent that the most straightfor- 
ward way to maintain the Experiment Proc- 
essors for use with the real time system was in 
the form of a library on magnetic tape. In line 
with this, two fundamental decisions were made 
concerning the physical makeup of these proc- 
essors, both related to the efficient use of the 
computer’s memory at execution time: they 
were to be in relocatable binary (as previously 
mentioned) and they were to be broken up into 
uniform-length blocks. Since it was considered 
unreasonable to require programmers to code 
their Experiment Processors in this latter man- 
ner, it was necessary to devise a method to take 
the programmer’s processor, coded in a single 
sequence, and convert it to the desired form 
automatically. 

In addition to checking the Experiment 
Processor, this conversion to uniform-length 
blocks is also provided by the Static Checker 
during the same run. This was easily facili- 
tated, since the Experiment Processor is input 
to the Static Checker in symbolic form and the 
necessary additional instructions and pseudo- 
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Figure 7. Conversion to Uniform-Length Blocks by Static Checker. 


operations may be inserted in that form (Fig. 
7). 

It will be noted that a few extra things have 
been inserted as well. For instance, the Experi- 
ment Processor obtains telemetry and other 
data from the Monitor by referencing cells 
which are absolutely located in the Monitor. 
Therefore, these locations are defined by BOOL 
pseudo-operations inserted by the Static 
Checker; the programmer need only refer to 
them symbolically. The Prelink and Postlink, 
which are bookkeeping linkages to the Monitor, 
are also inserted at this time. 

The breaking up of the consecutive sequence 
into uniform-length blocks is eifected by in- 
serting an ORG (origin definition) pseudo- 


operation at fixed intervals. The actual con- 
nection between one block and the next is han- 
dled by a ‘dynamic link,’ the BRU (uncondi- 
tional branch) instruction which is inserted at 
the end of each block. The advantage to be 
gained by this approach is that the blocks of a 
processor need not be contiguous in memory, 
and in practice this is usually the case. ORG’s 
are used to define the start of each block in 
order to ensure that the result of assembling the 
symbolic output of the Static Checker is a series 
of paper tape records of uniform length, sim- 
plifying the maintenance of the library of Ex- 
periment Processors on magnetic tape. 

The problem of pseudo-operations which de- 
fine more than one memory location, such as : 


DEC 1,2,3 Places decimal 1-3 in 3 successive locations 

OCT 4,5,6,7,10 Places octal 4-10 in 5 successive locations 

BCI 4, ALPHA FIELD Places BCD characters in 4 successive locations 

BSS 20 Reserves 20 successive locations. 
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is handled by a special diagnostic in the Static 
Checker which flags any such pseudo-operation 
which causes the ‘straddling’ of two blocks as a 
coding error. The programmer must then re- 
arrange his processor so that the locations de- 
fined will fit into one block. 

Other Service Programs 

The Static Checker has a companion program, 
the Dynamic Checker, which works with an 
Experiment Processor in Machine Language: 
the result of assembling the output of the Static 
Checker. The Dynamic Checker is essentially a 
simulator which provides the opportunity to 
debug an Experiment Processor under condi- 
tions as similar as possible to those of the real 
time Monitor. The Experiment Processor is 
embedded in the Dynamic Checker and is ‘fed’ 
telemetry frames via the same communication 
cells used in the real time case. The output of 
the processor is printed for subsequent analysis 
by the programmer. 

The Dynamic Checker uses for input a mag- 
netic tape containing simulated telemetry 
frames. Its modus operand! is to read a frame, 
branch to the Experiment Processor, and check 
to processor’s communication cells for com- 
mands and/or output, repeating this cycle until 
the tape end-of-file is reached. The input tape 
is generated by another service program, the 
Tape Builder, which contains a pseudo-random 
number generator and which, essentially, gen- 
erates a magnetic tape of simulated telemetry 
frames containing nothing but white noise. This 
is to check the Experiment Processors for one 
of their basic requirements : that they be 
capable of accepting any data and still not 
‘blow up.’ The Tape Builder, however, has the 
provision of accepting programmer-coded sub- 
routines which allows the data generated by 
these subroutines to be inserted in specified 
words in the telemetry frames. In this manner, 
the programmer may insert realistic data in the 
words of the telemetry frame which are ana- 
lyzed by his Experiment Processor and thus 
may use the Dynamic Checker as a diagnostic 
tool. 

Like the Static Checker, the Dynamic Checker 
has a second function in addition to a debugging 
one. This is the function of timing the Experi- 
ment Processor. One of the two interval timers 


available on the SDS 920 is used for this pur- 
pose; the result being that, at the end of the 
run, the Dynamic Checker prints out the worst- 
case (longest) execution time in machine cycles 
for the Experiment Processor being tested. As 
will be seen later, the two parameters : number 
of blocks (given by the Static Checker) and 
worst-case execution time (given by the Dy- 
namic Checker) are the ones which will deter- 
mine whether this particular processor may be 
loaded and executed at any particular time when 
the real time system is running. 

In addition to the above service programs, 
there is, of course, a Tape Librarian program 
whose task is to maintain the library tape of 
Experiment Processors; adding, deleting, or 
replacing processors as appropriate. In addi- 
tion, the Tape Librarian includes a computed 
checksum with each block written on the li- 
brary tape. These service programs taken to- 
gether provide a means of carrying an Experi- 
ment Processor from the coding stage through 
inclusion on the library tape in a fairly auto- 
matic manner (Fig. 8). 

Loading Experiment Processors in Real Time 

The Monitor has two^ restraints which deter- 
mine whether an Experiment Processor may be 
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Figure 8. Flow of Experiment Processor from 
Coding to Library Tape. 
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added at any given time: whether there is 
room in memory and whether there is enough 
time to execute it. The portion of memory avail- 
able for use by Experiment Processors, is called 
free storage and is allocated in blocks of the 
same size as the subdivisions of these processors. 
The amount of time available for Experiment 
Processor execution is called slack time and is 
the number of machine cycles presently not 
being used in the 18 ms. interval between suc- 
cessive frames of telemetry. A parameter table 
is maintained for use by the Real Time Load 
Processor containing one word for each proc- 
essor on the library tape. Each parameter 
word contains two fields which specify the 
amount of memory (in blocks) and amount of 
time (in machine cycles) required by that par- 
ticular Experiment Processor. When a request 
is entered into the computer to add a particular 
Experiment Processor, this request is handed 
off to the Real Time Load Processor which may 
easily check whether there is enough free stor- 
age and slack time to accommodate this Experi- 
ment Processor. If not, the request is placed in 
a special stack called the Request Stack. If the 
processor can be added, however, the Load 
Processor positions the library tape, reads the 
Experiment Processor into memory, relocates 
it, and connects it to the Monitor. Actually, the 
Load Processor makes requests to the Monitor 
for every tape operation given and it is the 
interrupt that occurs after each record is 
skipped or read which causes the Load Proc- 
essor to be entered again. (This includes back- 
spacing and rereading in case of a checksum 
error.) 

If the request given to the Load Processor 
is to delete an Experiment Processor, either be- 
cause this request has been made from the con- 
sole or because the processor has signalled the 
Monitor that it has finished its run, then the 
Load Processor locates this processor, disen- 
gages it from the Monitor, returns the proc- 
essor’s blocks to free storage, and adds the 
processor’s execution time to the slack time 
count. The Load Processor then checks its Re- 
quest Stack to see if there are any requests to 
add processors which now may be accommo- 
dated. 

The above description is considerably ab- 
breviated and does not cover some of the special 


considerations of the Load Processor as actually 
written, such as error checking, backspacing, 
and rereading due to checksum error, etc. Also, 
although it is convenient to speak of ‘picking 
up’ blocks of free storage and ‘returning’ them 
to free storage, nothing is moved around in 
memory the change is actually one of alloca- 
tions. 

The basic pieces of information the Load 
Processor works with are four in number (Fig. 
9). PAR AM is a parameter table giving the 
storage and time requirement for every Experi- 
ment Processor on the library tape. The block 
count is in the operation field and the execution 
time is in the address field. Thus, as illustrated. 
Experiment Processor 34 requires 3 blocks of 
storage and has a worst-case execution time of 
700s cycles. 

Figure 9 also shows SLACK, which is a cell 
containing the available slack time in cycles, 
1300s here. FREE is a cell which refers to Free 
Storage, the fourth element. It can be seen that 
an elementary list processing technique has 
been employed here. The address of the first 
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Figure 9. Elements Used by the 
Monitor Load Processor. 
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word in each block in free storage contains the 
starting location of the next so that they are 
chained together with the address of FREE 
giving the starting location of the end block in 
the chain. Furthermore, from bottom to top 
in the chain, the operation field of the first word 
in each block contains a serial count, starting 
at zero. This count is also carried over to the 
operation field of FREE and thus this cell not 
only provides a link to all of the blocks in free 
storage but also contains the number of blocks 
in the chain. The linking of blocks shown is 
somewhat idealized. In actual practice, these 
links become quite ‘scrambled.' 

Under this scheme, the process of testing 
whether a requested Experiment Processor may 
be added is handled quite easily, simply by using 
its parameter word from the table PARAM: 
the address field of the parameter word (execu- 
tion time) must be less than or equal to the 
address field of SLACK, and the operation field 
of the parameter word (block count) must be 
less than or equal to the operation field of 
FREE. 


FREE STORAGE 



Figure 10b. ‘Removing’ Blocks from Free Storage — 
Second Step. 


FREE STORAGE FREE STORAGE 



Figure 10a. ‘Removing’ Blocks from Free Storage- 
First Step. 


Figure 10c. ‘Removing’ Blocks from Free Storage — 
Third Step. 
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Figure lOd. 'Removing^ Blocks from Free Storage — 
Final Condition. 

Adding an Experiment Processor 

In order to add an Experiment Processor, 
blocks must be ‘removed' from free storage so 
that the processor may be read into them (Fig. 
10). The ‘removal' of a block is a two-step 
process, in which the address of FREE is 
stacked in a table called START and the con- 
tents of the location given by that address (the 
starting location of the block) is then stored in 
FREE. FREE now refers to the next block in 
the chain, and this process may be continued as 
often as necessary. This is, of course, the fa- 
miliar ‘popping up' of list processing and here 
it is done three times since Processor 34's pa- 
rameter word calls for three blocks. The final 
step is to subtract the execution time in the 
processor's parameter word from SLACK, set- 
ting it to the unused time now available. 

The table START is used for two things. 
First it is used to specify the blocks of memory 
into which the Experiment Processor is to be 
read. START is also used for relocation with a 
companion table BITS which is generated to 
contain successive multiples of the block length, 
starting at 0. Since the Experiment Processors 
are assembled with a base origin of zero, it can 
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Figure 11. Example of Relocation. 


be seen that BITS consists of the unrelocated 
starting location of each block. Figure 11 shows 
how this relocation is carried out on a sample 
instruction. 

The method of relocation allows an instruc- 
tion in any block to reference a location in any 
other block occupied by that processor. The 
dynamic link between blocks could then be 
effected by the final BRU *+1 in each block, 
simply by flagging these instructions for relo- 
cation. In practice, however, the Tape Librarian 
program replaces these terminal branches with 
computed checksums when an Experiment 
Processor is added to the tape library. The 
Load Processor uses these checksums to deter- 
mine whether the Experiment Processor has 
been loaded properly, with the usual re-reads in 
case of failure. When the processor has been 
successfully read in, the relocation sequence in 
addition replaces the checksums with the origi- 
nal branches, connecting each block to its suc- 
cessor and the last block to the Monitor. 

It might be pointed out that there is no fixed 
requirement that the blocks have a length 2008 
nor, in fact, need their length be a power of 
two, although this simplifies the relocation 
scheme. This figure was chosen as a convenient 
length and may be changed if experience with 
the system warrants it. 

Deleting an Experiment Processor 

Figure 12 shows how the blocks of the Ex- 
periment Processor added in the example above 
are ‘returned' to free storage. The starting lo- 
cation of the processor is obtained from the 
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Figure 12b. ‘Returning’ Blocks to Free Storage — 
Second Step. 

module in the priority table to which it was 
assigned and placed in a working cell, called 
DUMMY here. Since the terminal branch at 
the end of every block provides a link to the 
next block, this ‘pushing down' process can be 
carried out in a four-step process analogous to 
the one in which blocks were ‘removed' from 
free storage. The block count is updated in the 


Figure 12d. ‘Returning’ Blocks to Free Storage — 
Final Condition. 

process so that at completion FREE again con- 
tains the correct block count when the number 
of blocks given by the processor's parameter 
word are ‘returned' to free storage. Finally, the 
processor's execution time is added to SLACK 
to set it to the unused free time now available. 

As shown by these examples, program blocks 
are never actually moved in memory; they are 
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simply read into the first available blocks and, 
since the blocks are dynamically chained with 
branch instructions, these blocks need not be 
contiguous in memory. A processor is deleted 
essentially by disconnecting it from the Moni- 
tor and allowing the storage it occupies to be 
reused. Since the Tape Librarian program con- 
verts all block reservations in an Experiment 
Processor to the corresponding number of zero 
words when the processor is added to the li- 
brary tape, these memory^ blocks need never be 
cleared. 

Concliision 

We feel this project is of interest since it 
represents an unusual situation in data process- 
ing. This is due to the fact that, while there was 


a requirement to selectively process different 
data points on demand, there was not a need to 
process all of the data available. Given the 
selection of the specific computer, however, this 
project attempted to optimize its use by pro- 
viding the capability of doing the maximum use- 
ful work that memory space and processing 
time will allow. In this paper we have pointed 
out techniques to minimize the execution time 
of a powerful Real Time Monitor and to allo- 
cate reusable storage in a flexible and efficient 
manner. While using the computer to sample 
and monitor data is not a typical data process- 
ing application, it is hoped that this paper has 
indicated the efficiencies which we feel are in- 
herent in the real time approach. 
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PART ONE— NEED FOR A NEW ETHOS 
I. Introduction 

“Acceptability,” said John Kenneth Gal- 
braith, “is the hallmark of the conventional 
wisdom/’i 

Since, however, the pronouncement of the 
conventional wisdom is the prerogative of those 
in eminent public, academic, business or labor 
positions, I am not privileged to bore you with 
a recital of the conventional wisdom, and — 
even if I could do so — ^to entertain you by ex- 
pounding it at a properly sophisticated level. 
I must, therefore, look to that arch enemy of 
the conventional wisdom — ^the march of events. 

If ever a period in the history of man de- 
manded radical — I am using ‘radical,’ derived 
from radix, root, in its original sense, namely, 
going to the roots — fundamental wisdom, it is 
surely this revolutionary period of transition to 
a new era — ^the age of cyberculture — the new 
era that is formed by a science, cybernetics, 
born barely a quarter of a century ago, and a 
technology that, for all its precocious develop- 
ment, has barely left the cradle. 

Most of us in this room were probably proud 
midwives assisting in the delivery of the com- 
puting machine only a decade or so ago. 

Since then, the world has, changed radically. 
Three powerful new phenomena have preco- 
ciously reached their vigorous, boisterous ad- 


olescence — long before the world is prepared for 
the scientific-social-technological-economic-cul- 
tural revolution that has been unleashed. Those 
in the center of any revolution are always the 
least disturbed. The hub of a wheel is fairly 
stable, the eye of a hurricance is calm, and 
those who create the concepts and forge the 
twls of complex social revolutions are neither 
alarmed by the enormous power of their brain- 
children, nor are they surprised. 

Never has a powerful and complete revolu- 
tion developed more quickly than this cyber- 
cultural revolution that is affecting the lives of 
millions of human beings who have never even 
heard the new words to describe powerful new 
concepts. In fact, things have been happening 
so fast that even those who know a great deal 
about one of the phenomena have not had time 
to learn enough about the others — or about the 
world they are changing. 

First among the new phenomena is nuclear 
science. Introduced to a stunned world in its 
least attractive manifestation, nuclear science 
holds untold mysteries, unimaginable terror, 
and vast promises. Einstein said, when the 
atom was split everything changed except our 
thinking. Far too many people still think of 
thermo-nuclear bombs as super-slingshots 
Others realize that nuclear science might pro- 
vide the vast reservoir of physical energy we 
need to produce abundance for all mankind. We 


Cyberculture is composed of “cybernetics,” the science of control, and “culture,” the way of life of a society. 
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have great hopes for atoms for peace and must 
search for a way to use atoms for people. 

The second of the powerful new phenomena 
is not clearly focused yet, although a demon- 
stration of the destructive potential of nuclear 
science has shocked the entire world to see, at 
least as a vague vision, the new concept : peace 
as a positive phenomenon, a valuable and work- 
able instrument to settle human conflicts. That 
is quite different from man’s past experience 
for since the beginning of history mankind has 
known as an alternative to war only the comple- 
ment of war, an interlude between wars, oc- 
casionally even a reasonably prolonged absence 
of war. Even when there was no fighting, war 
has been regarded as the normal and accepted 
means to settle conflicts. Contrary to popular 
opinion, it is not a foregone conclusion that 
peace will bring about the millenium. There is 
no reason to believe that conflicts will disap- 
pear. And to use peace, rather than war, as an 
instrument to settle conflicts will require more 
ingenuity and intelligence and skill than to 
devise means to win wars. Difficult though it 
may be to live with, peace is the essential condi- 
tion, if human civilization is to survive at all. 

The least known and most far-reaching new 
phenomenon is the science of cybernetics and 
the revolutionary technology based upon its dis- 
coveries. Automatic systems and computing 
machines, even in their infancy, have an impact 
upon our world that could not have been 
imagined two decades ago; and they have the 
clearly foreseeable potential to produce not only 
unprecedented abundance for human beings, 
but relieve man forever of drudgery and toil. 
Yet, even experts still look at the computing 
machine as a super-abacus. 

Any effort to deduce how observable phe- 
nomena are likely to develop and affect the en- 
vironment involves some arbitrary assumptions 
that must be defined and granted. The major 
assumption in my hypothesis is so fundamental 
that, should it prove to be unreasonable, nothing 
on earth is likely to be proved or disproved 
again. I assume that the cold war will not be 
escalated into the nuclear fission of the earth, 
but that, on the contrary, it will continue to 
defrost. I further assume that all of us in the 
field of data processing and automation will 


continue to do our jobs with as much ingenuity 
and enthusiasm as we have in the past and to 
develop our precocious brainchildren, as we 
have every reason to expect from our auspi- 
cious start. 

As we know, there is a great deal of confusion 
in the public mind about the words “automatic” 
and “automation,” and about the effects of these 
rarely recognized phenomena. Economic pun- 
dits have made solemn pronouncements about 
the future impact of “automation” and based 
their predictions firmly upon a past experience 
with mechanization and its impact upon em- 
ployment and the Gross National Product. A 
few months ago. Secretary of Labor Wirtz esti- 
mated that automatic systems have reached the 
intellectual level of human high-school gradu- 
ates.2 

Monumental fallacies are incorporated into 
such statements because the basic premises used 
by economic pundits and by the Secretary of 
Labor are incorrect; they confuse automation 
with sophisticated mechanization and use these 
basically incomparable phenomena interchange- 
ably. If they could realize that the most so- 
phisticated and efficient mechanical system — no 
matter how many electrical components are in- 
corporated — is an open system that cannot 
operate unless the control loop is closed by a 
human being who must become part of the 
system, whereas an automatic system is a closed 
system in which the human component has been 
supplanted by a computing machine, they would 
understand that the conventional methods to 
inoculate the economy against periodic epi- 
demics of unemployment and slackness are no 
longer relevant. 

Before the Congress has been able to accept 
the Conventional Wisdom of one generation 
ago, everything changed. Everything but our 
thinking! And I must quote again John Ken- 
neth Galbraith, who wrote that “the shortcom- 
ings of economics are not original error, but 
uncorrected obsolescence.” We rightly cherish 
our intellectual heritage, but we must not allow 
it to calcify. The economic-political and social 
wisdom humanity may have acquired so pain- 
fully in the past must be tempered with new 
insights and forever re-evaluated with an open 
mind, just as the scientific and technical herit- 
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age of the past is constantly re-examined and 
re-vitalized by new discoveries and inventions. 

II. Agriculture and Cyberculture 

The present cybercultural revolution is com- 
parable in magnitude only to the agricultural 
revolution, the ferment out of which all civili- 
zation arose. The agricultural revolution 
changed the earth from a jungle into a garden 
where food gatherers became food producers — 
who plant and harvest, who create a surplus 
over their needs and thus build civilizations. 
With the agricultural revolution man first be- 
gan to emerge into humanity. He learned to 
control his environment, to adapt it to his 
needs, and to arrange his life into social pat- 
terns. The agricultural revolution that began 
to free man for his specifically human task 
changed the very nature of man. 

Every society in the age of agriculture goes 
through recurring cycles of scarcity and sur- 
plus, of leisure and drudgery. For centuries 
this has been the human situation : part drudg- 
ery, part creative endeavor ; part scarcity, part 
waste. The cybercultural revolution can create 
a world where machine systems produce un- 
dreamed-of abundance, and where human be- 
ings live human lives and are free to pursue as 
yet undefined human tasks. 

Man in the Stone Age knew his task was to 
find food for himself and his young and to pro- 
tect them from the dangers of a hostile world. 
He carved images on the walls of his cave, and 
sometimes there must have been a genius who 
observed the world closely, who somehow saw 
a pattern in remote incidents. He might have 
noticed that small plants grow into trees; that 
seeds spread by the wind or dropped by birds 
into the earth come forth again as plants, and 
that roots multiply and that some plants grew 
on the same spot again and again. He gathered 
the seeds and put them into the earth himself 
and watched over them and saw them bear fruit. 
And the age of agriculture could be born. 

Man learned to till the earth to produce 
bountifully, to tame animals to help him pull 
the plow, to use the power of water and the 
wind to multiply the strength of his own mus- 
cles. In the course of many centuries man has 
developed complex tools which extend the per- 


ceptiveness of his senses and the skill of his 
hands, and devised powerful machines to extend 
the strength of his muscles. But man alone can 
direct and guide his aids. He must still labor 
for his bread. 

The cybercultural revolution is brought about 
by the invention of devices that supplement the 
labor of man's mind. In the age of cyber- 
culture the plows pull themselves, and the plant- 
ing and harvesting is controlled by tirelessly 
efficient electronic slaves. 

III. What are Human Tasks? 

Man must learn to find new tasks to fill his 
days. If he no longer needs to pull the plow 
and clear the fields and forge the iron, how 
will he tire his muscles to earn his rest? How 
will he use his mind to earn his peace? How 
will he stand upon the earth he has not tilled 
in the sweat of his face, and feel that he is 
its master? What will he do with his life, if 
he no longer has to labor to earn his right to 
live ? ^ 

For centuries, and in every land, men have 
told stories about all-powerful, completely 
obedient slaves who would supply riches and 
ease. The brooms conjured up by the sorcerer's 
apprentice, the genie in the lamp, the monkey's 
paw — ^these are the stories of man's desire for 
a perfect slave and also of his fear. For man 
was always aware of his own inadequacy and 
he was not sure that he could control so per- 
fect a servant with wisdom and with honor. 

We can expect that in the age of cyberculture 
enormous populations will live in leisure. A 
few will '‘work," But no-one will labor in 
drudgery and sweat. This will be technolog- 
ically feasible in a few decades. Invention can 
be speeded with the motivation for perfection. 
During World War II, the invention of radar 
was accelerated — in the opinion of eminent sci- 
entists — by many decades. But cultural lag 
may delay to bring cyberculture to its maturity 
for centuries. Reluctance to change obsolete 
ways of thinking, conflicts of interests, the 
shortsightedness of those who fear what they 
cannot fully understand can delay the future 
and use the best fruits of man's mind for his 
destruction rather than his joy. 
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IV. The Problems of Transition 

The problems of transition from an agricul- 
tural-industrial to a cybercultural society are 
momentous. This is only the beginning. Un- 
employment, serious though it is, is not dis- 
astrously widespread yet. But soon it will 
be, if we refuse to face the fact that unemploy- 
ment cannot be arrseted, even with the most 
phenomenal economic growth rate in the world, 
for the acceleration of automation will always 
exceed the acceleration of the growth rate. 
Unemployment must be changed to leisure. If 
we can learn to live with and use our electronic 
and mechanical slaves, rather than abuse our 
human bodies and our human minds, we can 
solve all the other problems that plague us 
now: the fear of unemployment, the envy the 
poor nations have for the rich nations and the 
fear the latter have of the former, the sus- 
picious competition among the powerful. We 
negotiate about disarmament, but watching the 
unemployment figures rise, we quickly vote 
more money to be spent on producing lethal 
weapons. And as the unemployment monster 
rises, those who are gobbled up most easily — 
the unskilled — ^become afraid and rise in hatred 
and despair. Unskilled Negroes think it is the 
color of their skin that keeps them unem- 
ployed and white men fear that they will have 
to share the labor that is not fit for human 
beings and that none need to do in the age 
of cyberculture. Unions are losing members 
and try to stretch diminishing jobs by divid- 
ing them among more men, instead of enlisting 
as members those whose work can be done by 
machines and teaching them how to live human 
lives. 

The slower the transition from an agricul- 
tural-industrial society to a cybercultural so- 
ciety, the greater is the suffering that must be 
endured, and the smaller the chances that — if 
humanity survives into the next century — the 
emerging age of cyberculture will be a good age 
for human beings. Slow transition does not 
cushion difficulties any more than pulling a 
tooth a little bit at a time softens the pain. 
The difficulties are not caused by the new age, 
but only by the transition itself — so that the 
problem can be solved only as transition is 
accomplished. The best transition is a fast 
transition. If we could have the wisdom to 
introduce as much automation as quickly as it 


is technologically feasible, we could create the 
age of cyberculture in two decades. Slow tran- 
sition would bring such intense and widespread 
suffering that it may break into nuclear war — 
and end all civilization. 

V. Morality and Ethos 

To create the age of cyberculture requires 
something far more difficult than scientific dis- 
coveries and technical inventions. We must re- 
examine our moral values and our ethical con- 
cepts and the deeply ingrained notions to which 
we give lip service. And we must understand 
the difference between the moral values of man- 
kind and the ethos of a society. The sanctity 
of human life, the worth and dignity of the 
individual are moral values that are absolute; 
these always have been true and always will 
be true, as long as there are human beings. 
But the ethos of a society is transient and it 
must alter with the needs of the society. 

What we call our Protestant Ethic, although 
it is much older and spread far wider than 
Protestantism, is the ethos of any society that 
knows scarcity and danger. It is a good ethos 
where virgin forests must be cleared, and wagon 
trains sent across a continent. It is a good 
ethos as long as men must wrest their meager 
fare from the earth with courage and forti- 
tude and perseverance. In such a society, it is 
right that man should labor to plow the fields 
so that he might eat the fruits of the earth 
and bask in the sunshine of the heavens and 
dream under the shade of the trees. “Thou 
shalt eat thy bread in the sweat of thy face'^ 
is a good and reasonable precept in the age of 
agriculture. 

Already the ethos of scarcity is becoming an 
unjust burden. All too often thrift is no longer 
a god, but the graven image of past days to 
which we give lip service. To save one's earn- 
ings and thriftily mend last year's coat, and 
use last year's car, and warm up last night's 
supper no longer is admired. But — ^the ethos 
that commands man to eat his bread in the sweat 
of his face still governs our personal lives and 
our national policies. Although for millions 
of human beings there is no place where they 
can put sweat on their faces, we still believe 
that there can never be another ethos for the 
future than the obsolete ethos of the past. And 
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every year we are condemning more than two 
million human beings to the swelling ranks of 
the unwanted. We suspect them of incom- 
petence and laziness, or we pity them. We 
should re-examine the ethos that condemns 
millions who are simply the first contingent of 
citizens living under cybercultural conditions, 
without any preparation for the new age. 

When human intelligence has invented plows 
that pull themselves, it is more virtuous to 
know how to play and to learn how to live 
for the joy of living than to bemoan the end 
of human toil. 

As sons and daughters of puritans we do 
not know how to play and we look with terror 
at the ‘‘threat’' of unemployment and idleness, 
because we can’t conceive a prom.ise of leisure. 
What we call play, recreation and entertain- 
ment, is not play, but its very antithesis. Play 
is something one does spontaneously, joyfully. 
We rarely do anything just for the joy of doing ; 
but we do a great deal “in order to” gain some- 
thing else. Instead of enjoying a holiday, we 
take a vacation — ^the very word signifies that 
it is merely a void between the activities we 
consider real. The “vacation” is something we 
use “in order to” have more strength for our 
labors. Recreation is something we pursue “in 
order to” re-create our energy. Entertainment 
is “in order to” forget our cares. We eat “in 
order to” replenish our energy. Our children 
are trained for the joyless ethos of scarcity and 
given candy “in order to” do something adults 
consider virtuous. Only the very young are 
fortunate enough to be ignorant of this grim 
purpose and suck their lollypops in blissful 
ignorance and joy. But even the youngest tod- 
dlers are not permitted to play for very long. 
Before they leave the cradle, they are but re- 
quired to manipulate educational toys “in order 
to” learn control of their muscles or “in order 
to” learn to read. By the time they graduate 
from kindergarten we have infected our chil- 
dren and impressed them with our grim ethos. 
The joy of playing for the joy of playing is 
frowned upon. The joy of learning for the sake 
of learning has been destroyed by admonitions 
to learn “in order to” please mother, or to get 
good grades, or to get into Harvard or MIT 
twelve years hence. And by the time they arrive 
in Cambridge, they have not even the faintest 


memory of joy and play, and they grimly labor 
for their “credits,” “in order to” graduate to 
obsolescent jobs. 

VI. Ethos for the Age of Cyberculture 

The proper ethos for the age of cyberculture 
is one that would serve humanity well to build 
a good society. We know so very little about 
living human lives in leisure and abundance, 
in dignity and self-respect, in privacy and the 
assurance of the fundamental human right to 
be unique as an individual. We confuse leisure 
with idleness, and abundance with waste. We 
view with suspicion the attempt of a human 
being to preserve his privacy and suspect it to 
be an attempt to hide evil. And we almost 
take for granted that an anomalism or eccen- 
tricity is necessarily inferior to conformity. 

Nothing could be further from the truth! 
Idleness, like drudgery, is passive boredom suf- 
fered under duress, and waste is the misuse 
of anything — ^whether it is a scarce commodity 
or something plentiful. Leisure is the joyful 
activity of using our human potentials to the 
fullest, and abundance is intelligent economy, 
namely, the full use of natural resources for 
the good of human beings. Privacy is the fun- 
damental right of civilized human beings and 
a necessity if one is to live harmoniously with 
one’s fellow man. The uniqueness of individ- 
uals has made all human civilization possible; 
for the conformist cannot go forwards and only 
in the individualist’s dreams and the dissent- 
er’s vision today can the reality of tomorrow 
be conceived.^ 

To learn to live in leisure and abundance is 
the task of this generation. Even if we wanted 
to, we would not have the power to choose be- 
tween the past and the future. The cybercul- 
tural revolution cannot be reversed. But we 
can choose the future. We decide what kind of 
world we want to leave for our children ; what 
tve do now determines whether they shall exist 
in idleness or have a chance to live in leisure. 

VII. Early Signs 

Once we have grasped the fact that our pres- 
ent unemployment is only a beginning and that 
there can never again be a time when the labor 
of human beings will be required to produce 
what society wants, we can turn our human 



144 


PR0CEEDINGS-5PRING JOINT COMPUTER CONFERENCE, 1964 


intelligence to the problem of transition — 
namely, to prepare ourselves for the age of 
cyberculture by turning unemplojmient into 
leisure, by solving the transitional problems of 
scarcity, and by doing eversrthing human 
ingenuity can devise to perfect our electronic 
slaves and complete all processes of automation. 

We must rid ourselves of the erroneous 
idea that unemployment is still a negative pe- 
riod of waiting for a change to the positive 
state of being “gainfully” employed again. In 
this country, millions of human beings are in 
a negative state now. Many of them have been 
in this state for many months, years even, and 
many know that they will never be in any 
other state again. All the projections for the 
future — even the most alarming — consider only 
our past experiences. Only very recently have 
a few economists given their attention to the 
phenomenon of acceleration. “For too long they 
misled themselves and the public by projecting 
productivity into the future on the basis of the 
long-term average rate of past productivity 
gains. In so doing, they ignored the fact that 
their averages were a combination of relatively 
low rates in the distant past with significantly 
higher rates in more recent years.”'* 

Computing machines and automation are 
barely in their infancy, and already our world 
has changed beyond all recognition and com- 
parison. If we consider that all change is slow 
until it has overcome initial inertia, we can 
expect, before the end of this century, an in- 
crease in productivity that will dwarf the most 
alarming projections for unemployment. Sol- 
omon Fabricant, director of research of the 
National Bureau of Economic Research, warns 
that “ . . . the long-term pace of advances in 
output per manhour has speeded up. It was 22 
per cent per decade during the quarter-century 
preceding World War I. It has averaged 29 
per cent since. During the most recent period 
— ^after World War II — national product per 
manhour has been rising at an even greater 
rate, 35 to 40 per cent per decade.”® And 
to this should be added what is cautiously noted 
in the President’s Manpower Report : “Al- 
though the statistical data on this subject are 
too limited to warrant definitive conclusions, 
it is probable that underutilization of plant, 
equipment, and manpower resources has had 
significant effect in retarding productivity 


gains since the mid-1950’s.”« Reuther con- 
cludes that “under the stimulus of automation 
and other revolutionary technologies, there can 
be no doubt that the historical tendency for 
productivity to move forward at an accelerat- 
ing pace Avill continue into the foreseeable 
future.”^ 

To the acceleration of technological advance 
we must add — or (more realistically) multiply 
— ^the acceleration in the rate of birth. The 
“war babies” and “post-war babies” will be 
flooding into the labor market — between 25 and 
40 million of them in one decade. No rate of 
economic growth, no method of spreading jobs 
by decreasing the work week or extending vaca- 
tions can absorb the enormous by accelerated 
flood of unemployment. Any dam or deflection 
that worked in the past — forced consumption, 
exploring underdeveloped continents or outer 
space, for example — cannot be used to counter- 
act the potential power — ^for good or ill — of the 
increasing number and perfection of automatic 
systems that can produce 1,000 cars or 10,000 
or 100,000 cars without human intervention 
and with — at most — a few human monitors to 
watch dials and stand by for rare emergencies. 

If we allow human beings to remain unem- 
ployed because machines can do the drudgery 
of repetitive tasks, we are dooming untold mil- 
lions to useless lives without hope and purpose. 
Even if we devise the means to feed them and 
supply them with the output of machines, they 
will not long remain in idleness and scarcity, 
while the products of machines rot in ware- 
houses.® 

VIII. Lessons of History 

Instead of dooming the vast majority of man- 
kind to idleness and unemployment and the in- 
dignity of the dole, we must prepare now for 
leisure and abundance. There are some lessons 
we can learn from history. In the Golden Age 
of Greece we can study a society of leisure and 
abundance based upon wealth that was not 
created by the labor of any of the members of 
the society, but by slaves. 

We piously deplore the evils of obsolete 
slavery and believe it right and proper to con- 
demn millions to starvation, or, at best, the in- 
dignity of the dole. Let us look at Greek society 
honestly and examine how an unsurpassed 
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civilization was created amidst the wealth and 
leisure which, twenty-five centuries later, might 
well have been produced by electronic and me- 
chanical, instead of human, slaves. 

The Greeks differentiated clearly between the 
private life of a human being his life in his 
household which produced the necessities — 
oika, the Greek word for “home,” is the root- 
word of economics — and his life as a citizen, 
which Aristotle called hios politikos. The “good 
life” was the life as a citizen, was “good” 
because man, freed from labor by having 
mastered in his household the necessities of 
life, could pursue human tasks. “At the root of 
Greek political consciousness we find an un- 
equalled clarity and articulateness in drawing 
this distinction. No activity that served only 
the purpose of making a living, of sustaining 
only the life process, was permitted to enter the 
political realm, and this at the grave risk of 
abandoning trade and manufacture to the in- 
dustriousness of slaves and foreigners writes 
Hannah Arendt.”® 

However we may deplore the private, or 
household, life of the Athenian — in this century 
of electronic slaves we can so easily afford to 
condemn human slavery — ^we can only admire 
the unequalled height of civilization his public 
life produced. In his public life every Athenian 
strove to excel, i.e., to distinguish himself from 
all others, to be a unique human being, an indi- 
vidual unlike any other that ever liv«i or ever 
will live. The Athenian lived a human life, in 
play and work, but never in drudgery and 
labor. “Who could achieve well if he labors?” 
asked Pindar . 1 ® 

Several hundred years later and several 
hundred miles to the west of Athens another 
society existed whose citizens were freed from 
the necessity of labor in order to sustain life. 
But whereas freedom from want and the neces- 
sity to labor emancipated the Athenian into a 
human being who achieved excellence, Roman 
citizens became an idle mob under equivalent 
conditions of affluence. The decline and fall of 
the Roman Empire, wrote Edward Gibbon, is 
“the greatest, perhaps, and most awful scene in 
the history of mankind. The various causes and 
progressive effects are connected with many of 
the events most interesting in human annals: 
the artful policy of the Caesars, who long main- 


tained the name and image of a free republic; 
the disorders of military despotism . . .”“ 

The essential difference between Greece and 
Rome is the difference in their points of view, 
in their ethical concepts. The Greeks strove for 
individual execellence; they wanted to create 
beauty and contemplate the mysteries of the 
universe. Abstraction and generalization were 
their inestimable contributions to science. The 
practical they dismissed as not worthy of dis- 
cussion and recording,, Archimedes, whose 
practical inventions covered an astouding 
variety of applications, never thought them 
worthy of description. He wrote only about 
abstract mathematics; we learned from his 
Roman enemies that he invented marvelous 
machinery. 

The death of Archimedes, by the hand of a 
Roman soldier; as the great mathematician 
stood contemplating a diagram he had drawn in 
the sand, is symbolic of the end of an era. The 
Romans were great organizers, “but,” said 
Whitehead, “they were cursed by the sterility 
which waits upon practicality. They were not 
dreamers enough to arrive at new points of 
view.”i2 No Roman ever lost his life because 
he was contemplating abstract mathematics! 

Rome, her unemployed citizens idly seeking 
panem et drcenses, destroyed herself. The 
moral disintegration of Rome had begun long 
before Christ was born. Her conquests brought 
Rome only material luxury and human prover- 
ty. Roman citizens received their dole and 
idled away their humanity in ever more brutal 
titillation. “It was because Rome was already 
dying that Christianity grew so rapidly. Men 
lost faith in the state, not because Christianity 
held them aloof, but because the state defended 
wealth against poverty, fought to capture 
slaves . . . they turned from Caesar preaching 
war to Christ preaching peace, from incredible 
brutality to unprecedented charity, from a life 
without hope and dignity to a faith that con- 
soled their poverty and honored their human- 
ity. . . . The political causes of decay were 
rooted in one fact — ^that increasing despotism 
destroyed the citizen’s civic sense and dried up 
statesmanship at it source. Powerless to ex- 
press his political will except by violence, the 
Roman lost interest in government and became 
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absorbed in his business, his amusements, his 
legion, or his individual salvation.”^® 

We might ask why despotism increased in 
Rome, why the Athenian sought his excellence 
in art and philosophy and science, and the 
Roman in the material luxuries that were all he 
gained from his conquests. We might ask why 
Christianity so very quickly forgot that Christ 
taught human beings to live for the Glory of 
God, which means to live for the joy of living, 
of being human and why the ethos of scarcity 
perverted “living for the Glory of God” into 
laboring “in order to” assure the glory of the 
church. We might ask why the Athenian, 
though conquered and enslaved, mastered even 
his enslavement and his conquerors. We might 
ask whether the Golden Age of Athens could 
have endured if the Athenian had found a way 
out of his dilemma : his need for leisure and his 
rejection of human slavery. 

Returning to our own century of transition, 
we can rejoice that we have what humanity 
never knew before — slaves to free us from the 
necessity of laboring “in order to” sustain life 
that are not human, so that we need not be 
ashamed to enjoy what they produce. For the 
first time in human history, man can be free. 
Machine systems can provide him with leisure 
and abundance, and rescue him from the degra- 
dation of being either a slave or a master of 
another human being. 

But machine systems can do only what man 
wants. If human beings cannot learn to dis- 
tinguish between human tasks and toil fit only 
for machines, if we persist in competing with 
the machine for the repetitive, dreary, stultify- 
ing, de-humanizing jobs for which only ma- 
chines are suited, then humanity wlil become 
enslaved by the machine more cruelly than 
it has ever been enslaved by any despot of the 
past. For the machine provides us with slave 
labor; and, therefore, human beings who com- 
pete with the machine are, thereby, accepting 
the conditions of slave labor. Human beings 
who learn to use the machine wisely, on the 
other hand, will be freed by the machine to 
achieve excellence. 

We are at the cross-roads : one way leads to 
the Athens the Athenians could only dream of ; 


the other to a Rome more dreadful than the 
most ghastly Roman nightmare. 

Greece or Rome — that is the choice we have, 
the choice we must make now, the choice we 
should have made yesterday and for which to- 
morrow will be too late. 

PART TWO— METHODS OF TRANSITION 

I. Educating the Young 

A practical and relatively painless method to 
accomplish the transition into the Age of 
Cyberculture must begin with the education 
of the young. We are well aware of the fact 
that unemployed youth has already become a 
social problem, and we know that what we so 
inadequately call “juvenile delinquency” is not 
restricted to the underprivileged. The violence 
of youth and the crimes committed by children 
show, of course, the general moral decline. 
Even more serious than isolated outbreaks of 
violence, even more desperate than gangs of 
destructive hoodlums, is the widespread indif- 
ference and bewilderment among the young — 
whether they stay in schools that provide noth- 
ing but bland custodial care or whether they 
are drop-outs. 

The real problem of the young is that there 
seems to be no place for them in the world. 
They know society looks with dread upon the 
vast numbers that are pouring out of schools, 
and they know that it is wrong for them to be 
met with fear and loathing. They are the future 
of mankind, and they have a right to be wel- 
comed with joy. 

What would happen if the 25 to 40 million 
young people who will pour out of our inade- 
quate school system in the next decade were not 
to flood an already overflowing labor market, 
but enter instead into a period of basic educa- 
tion for the Age of Cyberculture? 

It would be infinitely harder — ^perhaps im- 
possible — ^to change very profoundly the prej- 
udices of those who have learned to labor and 
who have labored for too long. If their labor 
is taken over by machines, we can only make 
their emancipation which came too late for most 
to enjoy in leisure, as pleasant and comfortable 
as society can afford. And we can try to make 
their idleness not too shameful a thing. 



AN ETHOS FOR THE AGE OF CYBERCULTURE 


147 


But for the young we must do far more 
than train them to become another obsolete 
generation of laborers, for there can be no 
honest labor for them and no dignity in toil. 
Any human being who seeks to labor in com- 
petition with the machine is doomed to slavery 
and to the conditions of slavery. There is no 
human ditchdigger who can live on a scale low 
enough to let him compete with the steamshovel, 
and there is no human bookkeeper, or mathe- 
matician, who can compete with a computing 
machine. 

There is no human printer who can com- 
pete with a tape-fed printing press. And there 
is no human metal cutter who can compete with 
computing-machine controlled machine tools. 
The kejrword is “compete.” We can no longer 
afford to measure the value of a human being 
in the market place. i‘‘ 

In 1955, when the A.F.L. and the C.I.O. con- 
summated their marriage of convenience. Presi- 
dent Meany promised that the newly-weds 
would become parents of an expanding family ; 
they would “organize the unorganized.” But 
with the sole exception of Hoffa’s teamsters — 
who are, at best, considered naughty stepchil- 
dren — the family has not proliferated. The 
auto workers have lost 300,000 members since 
1953 — and this in spite of the fact that the in- 
dustry has achieved a glittering production rec- 
ord in 1963. The steel workers have diminished 
by 250,000. There are half a million fewer 
mine workers, and 760,000 fewer railroad 
workers.'® 

In spite of all the efforts made by labor 
unions to spread the work and to delay the dis- 
missal of workers, it takes considerably fewer 
of us to produce considerably more. Whether 
we regard feather-bedding an evil or a neces- 
sity — it does not prevent the spread of unem- 
ployment. At best it delays the inevitable dis- 
aster for a few; at worst it retards important 
improvements. 

The ranks of labor are diminished by the 
fired, but even more significantly they are 
starved at the source by the vast numbers of 
“the unhired.” Among the ever increasing 
number of the unhired, labor unions must find 
new blood and new strength, and a new lease 
on life. Labor must forget the organization 


methods of the past, when “marginal workers” 
were considered a poor investment. Many old- 
time union men say “they bring not back in 
dues what it costs to organize them,” and are 
thereby guilty not only of greater callousness 
than that for which they blame management, 
but of irresponsible short-sightedness. Unions 
who do this sort of cost accounting while they 
invest their estimated union wealth of $1.5 bil- 
lion in blue-chip securities and profitable real 
estate, are doomed to die of their own corrup- 
tion and decay. 

Labor and management must learn to invest 
in human beings. If labor unions would “organ- 
ize” youngsters who graduate (or drop out) of 
an antiquated school system, their membership 
rolls would swell (and without any crippling 
diminishment of their coffers) and their vigor 
would be restored. They would once again have 
a vital role to play in the society. They would 
once again breathe the fresh air of the future 
instead of suffocating in bank vaults clipping 
their coupons. 

Organized Labor should offer to educate 
every boy and girl who wants an education for 
the Age of Cyberculture. It is to be hoped that 
colleges will, in time, adjust their curricula to 
the needs of the future. With a few exceptions 
our institutions of “higher” learning are cus- 
todial, rather than educational, and perpetuate 
training their unfortunate students for obso- 
lescence. Education must not be equated to 
training for obsolescent jobs. Since our present 
feeble attempts at “re-education” are not edu- 
cational at all and do nothing at all to prepare 
human being to live in leinsure and abundance, 
they could not, and do not, have the slightest 
effect upon our present unemployment prob- 
lem. Such “re-training” efforts are like aspirin ; 
it can disguise a headache for a while, but it 
cannot cure cancer, or even a headache. 

We throw a feather into the Grand Canyon 
and we are surprised that there is no echo ! The 
education that must be provided to get an echo 
from the age of cyberculture must make it pos- 
sible for human beings to learn how to live 
human lives and to create an ethical system 
that will permit human beings to do whatever 
they do gladly and for the sake of the thing 
itself; and not reluctantly and only “in order 
to” make a living. 
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A good curriculum might well start with 
questions about Greek civilization and Roman 
decay. Whatever the specific subject, its aim 
must be to open the eyes of our twentieth-cen- 
tury blind children to the eternal miracle of 
life. 

One labor leader, enlightened about the vital 
need to educate human beings, says : “America 
will be a much better place when everybody 
works four hours a day and attends some kind 
of classes four hours.” He made a start — small, 
but of tremendous significance. His local has 
financed scholarships for children of members. 
More important, it is seeking out latent talent 
among its members (in the hope of developing 
the union’s future leaders) for the “Futurian 
Society.” The best-educated among the mem- 
bers conduct courses for other members. And 
at the local’s Long Island estate, seminars are 
held in such “impractical” sul^jects as literature 
and art and philosophy.'® 

Since the education of several millions of 
youngsters involves far more than a one-week 
seminar, much more is required than a Long 
Island estate and the funds a local can afford. 
Unions are not that rich. But humanity is. 
Such a vast education program must be financed 
by government subsidy, in part. This can be 
justified — even to the satisfaction of the vic- 
tims of the Puritan Ethos — as a perfectly rea- 
sonable investment which, partially at least, 
pays for itself out of savings in unemployment 
compensation, relief, and the costs of custodial 
care for those who would surely commit crimes 
if they cannot find a positive purpose in life. 

Another source of financing should be sup- 
plied by the very machines who have replaced 
human laborers. At least one manufacturer of 
automatic machines is a poineer in this ap- 
proach. He “taxes” every machine he sells by 
putting away a certain sum which contributes 
to the support of a foundation to make various 
studies of several facets of the problems created 
by the very existence of automatic machines. 
“If machines perform our labor, then machines 
will also have to pay our taxes.”''' This may 
be a socially acceptable way of saying that the 
abundance produced by machines must be avail- 
able to human beings, lest it rots away and 
destroys humanity with its fetid decay. 


Union funds, special taxes paid by manu- 
facturers, a contribution made by the entire 
population in the form of government funds 
are three sources to finance the preparation of 
youth for the Age of Cyberculture. Of course, 
it would be foolish to prepare millions of young 
people for a world which, simultaneously, we 
try to postpone. When we no longer need to 
be concerned about new floods in the labor 
market, there is no longer any reason to at- 
tempt any delays in complete automation. We 
are then free to encourage technological inven- 
tiveness and to complete the process as quickly 
as possible, not only in the industrialized na- 
tions but — under the sponsorship and tutelage 
of the United Nations — of the underdeveloped 
countries. 

Such an acceleration of automation would in- 
sure full emplo3nnent for the existing labor 
force, not only in feather-bedding and busy- 
work, but by the full occupation of highly 
trained personnel. It is the only method of 
achieving full employment and full occupation. 
We have neither one nor the other now. 

How many scientists and engineers are still 
employed, but under-occupied? How many pass 
their days in idleness at their well-designed 
desks and in their well-equipped laboratories? 
How many repeat endlessly insignificant ex- 
periments, and waste all their ingenuity in 
inventing more innocuous re-search (not 
research) projects?'® 

II. Experiment in Attrition 

It is highly probable that, without the influx 
of youngsters into the existing labor force, 
normal aging and retirement — ^a process called 
by modern economists and labor experts “attri- 
tion” — will diminish the labor force over the 
next two decades at approximately the same 
rate as machines replace human drudgery. In 
the pioneering agreement, the “Long-Range 
Sharing Plan,” the union and the Kaiser Steel 
Company pointed the way for a company and 
its employers to share the fruits of automation. 
Whatever can be saved by greater productiv- 
ity and efficiently is divided: one third to the 
workers, two thirds to the company (which 
must share its two-third’s portion with the 
government) . 
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Any laborer whose job is eliminated by a ma- 
chine continues to draw his pay for one year. 
During this time he is placed into a labor pool 
which acts as a reserve to fill in for absentees.^® 

It is too early to draw general conclusions 
from the Kaiser experiment, but the officials of 
the steel workers’ union are reasonably pleased 
and the company considers the pool an asset. 
This plan does not provide a complete answer 
to the problems of automation, and it does not 
stop the process of replacing human drudgery 
with machine slaves. It does provide a cushion 
for a few individuals. 

III. Hope For the Future 

If one steel plant can create a labor pool, if 
one manufacturer can tax his machines and 
use the money to encourage study, if one union 
local can provide a place for learning and 
reflection for its members with such remark- 
able results, we have good reason to be hopeful 
for a bright future. 

If we set ourselves as a long-range goal a 
good cybercultural society, we can solve the in- 
termediate problems and devise appropriate 
measures to overcome the immediate difficul- 
ties that are attributable to the phase of tran- 
sition rather than the advances in technology. 
The immediate consequences of diverting the 
young and unskilled from the labor pool and 
into a constructive program of education for 
the age of cyberculture would be dramatic. It 
would, first of all and for all times, wipe out 
the demoralizing condition of hopeless unem- 
ployment. 

We know that in this rich country there is 
considerable and stubborn poverty. Why does 
it exist when granaries are bursting with 
surplus food, when farmers are paid for con- 
senting to let their fields lie fallow, and when 
stores are filled with every sort of consumer 
product so that customers must be enticed to 
want what they do not need? We know that 
poverty, in this country, is poverty in spite 
of abundance — poverty caused by inadequate 
means of distribution, by the ethos, not the real 
existence, of scarcity, i.e., by unemployment. 
To wipe out unemployment and the ethos of 
scarcity is to wipe out poverty. 


This is not true of poverty everywhere on 
earth as yet. There is still real poverty in this 
world, and there are still poor nations, although 
even the poorest nation in the age of cybercul- 
ture is not intrinsically and forever doomed to 
poverty or the charity of others. That is why 
we call the poor nations quite appropriately 
underdeveloped nations. The most enlightened 
domestic policy we might pursue cannot solve 
world-wide problems, and the most intelligent 
and ingenious international agreements can- 
not forever eliminate the danger of nuclear war. 
We must do much more. We must not have un- 
derdeveloped nations. The age of cyberculture 
must be universal. 

IV. World-Wide Cyherculture 

If we eliminate unemployment in this coun- 
try immediately and proceed to create automa- 
tion at the fastest possible rate, we shall almost 
simultaneously free an enormous number of 
highly trained and skilled people — ^the employed 
but under-occupied — who could, under the spon- 
sorship of the United Nations, assist and advise 
the underdeveloped nations to build modem 
automated industries. 

At the present time, most of the newly 
formed nations of Africa seem to be diligently 
creating nineteenth-century conditions of the 
worst sort. It is not surprising, since most of 
their leaders were educated in Europe or the 
United States two or three decades ago so that 
they were imbued with the ideas of nineteenth- 
century Europe and America. 

They dream of leading their countrymen out 
of the jungle, and they are bringing them 
straight to the horror of nineteenth-century 
city slums. It is preposterous to lead human 
beings out of insect-infested green jungles and 
turn them into obsolescent masses of unskilled 
and unwanted laborers in rat-infested city 
jungles. 

Surely among all the remarkable intellects 
that have asserted themselves in the new na- 
tions of the world there must be some who can 
understand not only John Locke and Voltaire 
and Marx, but also Russell’s mathematics and 
logic and philosophy, and Wiener’s cybernetics. 
Surely there must be some among them with 
the imagination not to imitate nineteenth-cen- 
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tury American-European industrialism, but to 
create twenty-first-century cyberculture. Such 
a person, if he also has power in his country 
and influence among his people, can conserve 
the most admirable values of his native culture 
and create, at the same time, good living con- 
ditions for his people. He could bring to his 
country the best our European-American civi- 
lization has to offer without dooming them also 
to the worst. 

If the ethos of the society that sees virtue in 
laboring “in order to” gain something — ^bread 
or status, or self-respect— changes to an ethos 
of abundance, it will be virtuous for human 
beings to live human lives in leisure and abun- 
dance. And no-one can be sure what un- 
dreamed-of heights humanity can reach when 
human talent is no longer wasted in the basic 
struggle to survive. 

It is surely the only sensible and practical 
choice to prefer leisure and abundance to idle- 
ness and waste. The choice must be made — 
and it must be made now. We cannot ignore 
the powerful new phenomena human intelli- 
gence has created. We cannot abdicate our 
responsibility to choose how they shall be used. 
For in our very abdication of choice, we would 
choose the worst alternative: to drift blindly 
toward disaster. 

If we want to conserve our traditional values 
— the right of the individual to life, liberty, and 
the pursuit of happiness— we must choose wise- 
ly and act boldly. 

Our history was made by human beings with 
bold vision and good sense, with deep moral 
convictions and human compassion for human 
frailty, with respect for the dignity of human 
beings and love for mankind, with the imagina- 
tion to dream and the courage to act. Such men 
and women cross oceans, transform continents, 
and build the City of Man ! 

That is our heritage. 

Appendix 

A Program for Transition — Immediately At- 
tainable Goal 

Keeping long-range goals clearly in mind, we 
must make many choices and decisions about 


measures to relieve the problems caused by 
transition. Cybernation affects the unskilled 
earlier than the skilled, it makes certain skills 
obsolete sooner than others. This causes un- 
tenable conditions of abject poverty and a 
sense of personal failure in the midst of great 
affluence and achievement. It takes time to 
erase the prejudices and superstitions accumu- 
lated for centuries. 

The following suggestions to make the transi- 
tion period as constructive as possible and to 
prevent suffering is a goal that is attainable. 
None of the measures proposed will have an ad- 
verse effect upon the long-range goal of creat- 
ing a cybercultural society where human beings 
can live human lives. And, to some extent, all 
measures will contribute to achieving a cyber- 
cultural society as soon as possible and in a 
humane manner that is of benefit to the in- 
dividual. 

I. Government Action 

1. An agency for cyberculture should be set 
up. The agency must have sufficient funds 
and power to study the over-all ramifica- 
tions of new phenomena, to study specific 
recommendations made outside the agency, 
to initiate large-scale experiments, and to 
carry out programs the agency wishes to 
adopt without undue delays imposed by the 
Congress or other governmental and non- 
governmental bodies. The Agency should 
have representative of all branches of gov- 
ernment, of organized labor, of industrial 
management, of academic life, and — most 
important — of the unorganized consumers. 
Representatives of the unorganized con- 
sumers, i.e., the vast majority of citizens, 
should be knowledgeable people who are 
completely distinterested, but intensely 
committed to the goal of a good cyber- 
cultural society with the courage to try 
the untried and the reverence for perma- 
nent values that must be conserved. A test 
laboratory should be set up immediately 
(Long Island would be most suitable) . 

2. Public works: there is an almost inex- 
haustible need for an extensive program of 
public works, such as road building, con- 
struction of dams, irrigation, conversion of 
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sea water, construction of hospitals, 
schools, parks, recreation and holiday 
facilities, beautification of cities, slum 
clearance, low-cost (but comfortable and 
beautiful) housing (about one million units 
per year is perfectly attainable), a reason- 
able power system to provide abundant and 
low-cost power for industry and homes in 
the most remote sections of the country. In 
addition to an extensive public-works pro- 
gram, subsidies to individuals should be 
provided, on a generous scale and in gen- 
erous amounts, to artists, writers and other 
intellectuals, and to artisans and crafts- 
men. This will encourage the most valu- 
able members of a society and make it pos- 
sible for them to devote all their energies 
to their human tasks; creating a great 
civilization with a blossoming of art, 
science, and philosophy, as well as a great 
revival handcrafts. Pride in artistic and 
intellectual achievement and pride in fine 
craftsmanship would benefit not only the 
individuals subsidized but the community 
at large. 

3. Constitutional guarantee of Living Certi- 
ficate. All means of distributing funds to 
those who cannot find jobs should be co- 
ordinated. This would make it possible for 
the most severely (financially) handi- 
capped to move from congested and expen- 
sive areas. Most of all, it would help to 
restore a sense of dignity and worth to the 
unemployed and make it possible for them 
to learn the use of leisure. 

4. Generous Re-location Allowances. Fi- 
nancial assistance for relocation would 
make it possible for families to move to an 
area where (1) jobs might be available, 
(2) educational facilities might be accessi- 
ble, and (3) climatic and housing condi- 
tions might be advantageous. Although 
reasonable precautions to prevent undue 
abuse may be required allowances should 
not be restricted to job opportunities. 

5. Large-scale education program to assist 
the chronically (often for generations) 
under-educated, to develop the potential of 
the gifted by enabling and motivating them 
to continue their schooling for as long as 
they can benefit thereby. Education facil- 


ities should be extended and made practical 
for adults up to any age when there can be 
a reasonable expectation of achievement. 
(Achievement to be defined in terms of a 
personal sense of dignity rather than mate- 
rial success). 

7. Rehabilitation of neglected and deteriorat- 
ing areas: urban and remote on a generous 
scale and with the conscious intent to re- 
store natural beauty and create a^nd/ or re- 
store urban beauty. 

8. Transportation: construction of rapid- 
transit systems, both urban and inter- 
urban, with particular regard for the com- 
fort and convenience of passengers and the 
beauty of the community. 

9. Study of the best use of the licensing power 
of government, and exercise of the best use 
of such power, to carry out the transition 
to a cybercultural society as rapidly as pos- 
sible. 

10. Use of the taxation power of government to 
expedite and ease the transition from 
human labor to automatic machine systems, 
apportioning costs of the transition period 
fairly, motivating industry to cybernate 
and finance the generous application of 
the principle of constitutional Living Certi- 
ficates. 

11. In the process of conversion from a de- 
fense-oriented to a peace-time economy, all 
obsolete military installations and mate- 
riel should be transferred to the communi- 
ty for the best civilian use. Transfer should 
be guided; trained personnel (from the 
military) could participate in the conver- 
sion to civilian use. 

12. A large-scale education and public relations 
program to explain the cybercultural re- 
volution should be devised and carried out 
with the cooperation of non-governmental 
organizations. 

II. Organized Labor 

1. The labor movement should play a signi- 
ficant role in the transition period by rec- 
ognizing that their respoTisibility extends 
to the unemployed, as well as to the (as 
yet) employed. 
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2. Organizing the unorganized including 
young people leaving school and unable to 
find employment. 

3. Vast educational programs to prepare 
union members (and others) for leisure. 
Such programs should be non-utilitarian 
and designed primarily to stimulate people 
to think through the familiar patterns of 
the past and distinguish between obsolete 
conventions and permanent values. A 
realistic program designed to make ‘‘free’' 
time a time of leisure and accomplishment 
rather than idleness would include educa- 
tion for hobbies and, simultaneously, foster 
an awareness that the ethos of scarcity is 
not necessarily virtuous. 

4. Investing the vast welfare and pension 
funds of unions in enterprises of social 
value and thus exerting enormous influence 
upon the shape of our economy and our 
value system. 

5. Cooperating in all programs to expedite 
the transition to a cybercultural society 
and simultaneously entending the sphere 
of influence of organized labor to those 
whose laboring has been taken over by 
machine systems. 

III. Individual Action 

1. In every kind of community, professional, 
special-interest or other types of organiza- 
tions, the individual can spread his insights 
about the cybercultural revolution. Dis- 
cussion with friends, with large groups can 
serve to alert others to the fact that 
changes are taking place, that changes 
are not necessarily to be feared, and that 
we do have choices to make to determine 
the kind of world that is to evolve from 
the changes. For although we can not elmi- 
nate the fact of change, we can determine 
the direction into which change propels 
us. 

2. The thoughtful individual must seek as 
much information as possible, sift informa- 
tion carefully, and articulate what he has 
learned. 

3. The education of the population in a free 
society is the concern and responsibility of 


all citizens. Education can spread by in- 
dividual and group action. 

4. Interest and concern for the action of gov- 
ernment, industry, and labor, is the duty 
of the individual. No government program 
is too complex, no corporation too rich, nor 
labor union too powerful for most careful 
scrutiny by the individual. No injustice is 
too minute, no human suffering too remote, 
no person’s fate too unimportant to affect 
profoundly the structure of the society. 
If the most insignificant right of the least 
important among us is endangered ever 
so slightly every right of every one of us 
is in danger. Only the individual can guard 
the rights of the individual. And the in- 
dividual can and must demand that all ac- 
tion be carried out for the benefit of in- 
dividuals in the society. It is still true that 
a nation has the government and institu- 
tions it deserves. 

5. Individuals, singly or through their or- 
ganizations, must demand generous educa- 
tional facilities and sources of information. 
The media of public information should 
be re-organized to serve the public interest. 
Freedom from commercials on the air and 
from page after page of advertisting ob- 
scuring information in printed material 
should and can be demanded by individuals 
(which does not exclude the use of govern- 
ment licensing power to protect the in- 
terest of the individual.) 

6. Individuals should make conscious efforts 
to examine their own values, discard their 
prejudices, and accept new ideas. ‘‘The 
unexamined life is not worth living,” said 
Socrates. This clearly makes it the re- 
sponsibility of every citizen to examine his 
own life and to assist others (though not 
force it upon them, because that would be 
ineffective) to examine their lives. 
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This brief paper will focus on the tremendous 
potential which now exists for the development 
of a more rational and effective technique for 
dealing with problems of manpower utilization. 
This topic should call for a far more detailed 
and sophisticated treatment than it is about to 
receive from me. For this, I must apologize, 
both to this audience and my own conscience. 
I feel, however, that the opportunity should be 
taken to outline to this particular audience the 
sort of information processing system which I 
feel must be developed if we are to have a more 
effective means for dealing with the growing 
manpower problem in the U.S. 

In discussing the implications of information 
processing for manpower problems, what I am 
really concerned with are the two major prob- 
lems, unemployment and underemployment. At 
the present time, our nation would appear to 
have an embarrassment of riches in both of 
these areas. At the rate we are going, Fm some- 
what pessimistic about our becoming less 
wealthy with respect to these two surpluses. 
Aside from the fact that insufficient levels of 
demand for goods and services necessary to in- 
crease the demand for labor would appear to be 
on the horizon, there is an increasingly difficult 
problem of an inadequate amount of informa- 
tion to guide both potential employers and em- 
ployees in the match-making process we call 
the ‘'labor market."' Before indicating the de- 
ficiencies of our labor market and what the im- 
plications of information processing may be. 


let me briefly describe what the economist sees 
as the functions of the market. 

One of the major assumptions of economic 
theory is that of the “market place" and its 
function of establishing contact between the 
forces of supply and demand. In the market 
place those with a product or service to sell meet 
those who may be interested in the purchase of 
the product or service, and in the bargaining 
process establish the “market price" and con- 
summate the economic transaction. In this way, 
the needs of the consumer and the products of 
the suppliers lead to the pricing and production 
decision policies which ensure that the economy 
is functioning in an optimal manner— at least 
with respect to supplying what the market indi- 
cates is needed. 

Obviously, I have not yet included a factor of 
major importance regarding the function of the 
market — ^that of communication. In the case of 
many products and services, we are all aware of 
the consummate and skillful use of the various 
news media in apprising us of their qualities. 
There is not only very little of a lag between the 
entrance of new products into the market and 
the public information through television, news- 
papers and magazines of these new items — but 
many of us are very frequently informed months 
ahead of the imminent introduction into the 
market of the new, completely revolutionary 
“widget" without which no home, car, person, 
or society can be complete. Indeed, we are 
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even made aware of those types of items which 
are available for the person “who has every- 
thing.” 

But in one all-important sector of our eco- 
nomic life, the paucity of information and the 
turtle-like movement in the direction of the de- 
velopment of a communication system to con- 
nect the forces of supply and demand represents 
what is tantamount to an unpardonable sin. I 
refer to our labor market and the theoretical 
mechanism for matching available and pro- 
jected job needs with the unemployed or those 
whom we can foresee as being unemployed. 

As you undoubtedly know, the unemployment 
situation in the United States has been one of 
increasing concern since the mid-1950’s. In the 
past 10 years, we have not been at a level of un- 
employment which even the most conservative 
economists can view with equanimity. Though 
the present rate of 5.6 per cent is far above the 
3 per cent we usually view as the maximum de- 
sirable level, there are many economists who 
feel that the real rate of unemplo 3 rment is much 
higher. The bulk of this is concentrated among 
the young, the old, and the minorities — chiefly 
the negroes. There is currently raging an argu- 
ment between economists as to the reasons for 
this rate of unemployment, which, in an un- 
precedented manner, continues to hang on like 
a summer cold during the sunny seasons of 
apparent prosperity. Some claim this results 
from structural changes in our economy which 
throw large numbers of people into the category 
of the unemployable, while the other group con- 
tends that the real culprit is too low a level of 
economic growth. For our purposes today, 
which is the culprit, or whether it is both, as I 
believe, has relatively little signiflcance. In 
either situation, there is an increasingly major 
role which computers must begin to play if we 
are to move in a direction of an optimal utiliza- 
tion of manpower. Several factors would lend 
a sense of the imperative to the application of 
a more scientific system for the acquisition, 
storage, retrieval and dissemination of labor 
market information. 

Increasing rates of technological change pro- 
duce a shorter lead time between the potential 
of the innovation and the innovation itself and 
its possible resulting labor displacement. Con- 


sequently, the amount of time in which society 
can anticipate retraining needs and methods 
for increasing worker mobility is shortened se- 
verely. With an ever-increasing labor force, the 
numbers are far from inconsequential. I might 
also add that along with the obvious economic 
facet of the unemployment problem, given the 
age and minority group characteristics of this 
unemployment population, the stimulus for 
dealing with this problem in a more logical man- 
ner would seem to increase rapidly. 

How can computers be of importance in deal- 
ing with this situation? To begin with, com- 
puters and the development of a labor market 
communication system will not create jobs, at 
least not in the numbers necessary to put a 
serious dent in the unemployment population. 
But a communication system can begin to pro- 
vide a much more effective means of establish- 
ing linkages between those possessing specific 
skills and those possessing the jobs for which 
these skills are needed. Those of us who have 
been involved in labor market analysis are more 
and more aware of jobs for which there are no 
takers because the takers are unaware of the 
job openings. Logically, one asks, what about 
the use of such agents of communication as the 
newspapers, private employment agencies and 
the United States Employment Service. In ex- 
amining these major possibilities, several “nec- 
essaries” for an optimal solution should be kept 
in mind. To begin with, the description of the 
available job and the matching skill must be 
fairly detailed ; second, there must be a formal- 
ized network which brings together the parties 
most interested logically in each other’s poten- 
tial ; third, there must be the maximum cover- 
age by industry and geographical area, of all 
job and skill-matching potential; and fourth, 
there must be a minimal delay in sending the 
message to the logical parties, both on the sup- 
ply, or employee, side and the demand, or em- 
ployer side. 

Now what is the situation with respect to 
newspapers, private employment agencies, and 
the U.S. Employment Service? If we begin with 
newspapers, I think we immediately see that on 
the employer’s side, there is a real opportunity 
for advertising in ample description. With re- 
gard to the unemployed individual, however, it 
is difficult to envision a situation where the 
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individuals who have been laid off, many of 
whom are in the unskilled or semi-skilled cate- 
gories, will be in a position to advertise. This is 
so, firstly, because of the financial problem. 
With very limited financial means it is impos- 
sible for an unemployed individual to be able to 
advertise in sufficient length so that an adequate 
background description can be given. Aside 
from this, many of the items and characteristics 
which we have mentioned above could not pos- 
sibly be contained in the advertisement which 
an unemployed individual might conceivably 
place in a newspaper. Secondly, the newspaper 
is hardly a formalized network which can bring 
together the parties most interested logically in 
the potential of each other ; that is, the potential 
employer and the unemployed individual who 
may have the characteristics being sought by 
the employer. It is a very informal source of 
information with no compulsory or systematic 
feature about it. As a matter of fact, in a study 
done in 1962 by Eva Mueller and Jay Schmied- 
eskamp, supported by funds from the Upjohn 
Institute, it was found that only between 5 and 
10 per cent of the unemployed who were able 
to locate new jobs had been able to locate the 
jobs as a result of their use of newspaper ad- 
vertisements. The newspaper very seriously 
lacks the potential of a network which can be 
institutionalized and which can continuously 
and in an exhaustive manner bring together all 
of the sources of information which we need for 
the network system we must begin to envision. 
Since it is also completely voluntary, there is no 
real opportunity for an extensive coverage 
either by industry or by geographical area. 

If we look at the private employment agen- 
cies, we immediately see that there is the major 
hurdle for many of the unemployed of a sub- 
stantial fee or part of their salary which 
would be deducted upon placement. Private em- 
ployment agencies also lack the tremendous 
scope of coverage in terms of industry and geo- 
graphical areas which we have set forth as one 
of the major criteria for a successful system. 
One of the objectives which the system we are 
beginning to formulate should have is that 
of being capable of directing the unemployed to 
positions available outside of the immediate 
areas in which they reside. Another problem 
of the private employment agency is that very 


frequently they tend to choose those sorts of 
individuals whose skills represent a higher 
probability of placement rather than a lower 
probability of placement. The very practical 
reason for this has to do with the high cost of 
continuing to attempt to place individuals 
whose skills are such that they represent a 
higher cost rather than a lower cost problem 
with respect to placement. The job placement 
success of employment agencies can be com- 
mented upon quickly by alluding to the Mueller- 
Schiedeskamp study, where it was found that 
only between 7 and 14 per cent of those unem- 
ployed who finally found work found the job 
as a result of the use of an employment agency, 
public or private. 

Finally, let us look at the United States Em- 
ployment Service. The job description informa- 
tion and the matching skill information devel- 
oped by the U.S.E.S. may or may not be fairly 
detailed, depending on the efficiency or the em- 
ployment service officer or interviewing per- 
sonnel in the individual state agencies which 
make up the U.S. Employment Service. Like 
private employment agencies, between the 
states we have an erratic behavior pattern with 
respect to the adequacy of the job description 
information. However, the U.S. Employment 
Service does have the skeleton of a formalized 
network which can bring together parties most 
interested in each other’s potential. This is so 
because it is organized with a central focus 
coming from the Federal Government with em- 
ployment services located in and controlled by 
the individual states. This is a very important 
potential with respect to our envisioned com- 
munication system. By virtue of this distribu- 
tion of officers, there is the potential coverage 
of each of the industries and geographical areas 
in the United States. However, the U.S. Em- 
ployment Service, like the other sources of in- 
formation, lacks in the ability to have a minimal 
delay in sending messages to the logical parties 
both on the supply — or employee — side and on 
the demand — or employer — ^side. Of great im- 
portance, however, is the fact that within the 
present unemployment placement services of 
the U.S. Employment Services, there is the seed 
from which can grow a major automated com- 
puter operations system which is calculated to 
provide the basis for matching information on 
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available jobs with available individuals for 
these jobs. Under the original Wagner-Peyser 
Act, which authorized the establishment of the 
U.S. Employment Service, there was author- 
ized a system of job-clearances between the 
states, and during World War II this activity 
was of tremendous importance in achieving a 
greater degree of mobility of the labor force 
and moving scarce skills to defense industries. 
This activity has continued as part of the em- 
ployment service and has been recently im- 
proved. Its most effective recent operation has 
to do with the professional office network. This 
particular plan deals with making potential 
jobs available to professional people and poten- 
tial workers available for the consideration of 
employers needing their services on a nation- 
wide basis. This program was begun in March 
of 1956 after a number of pilot projects and 
experiments in 8 states and the District of 
Columbia and Puerto Rico. At the outset, the 
plan was to provide for a flow of unfilled orders 
and unmatched applications for the local office 
to a central “key city” office in each one of the 
individual states concerned with attempting a 
state-wide effort to find openings or recruit 
applicants. Beyond that there was an exchange 
of still unmatched orders and applications 
among the various key cities which were de- 
fined. This was expanded so that by 1963 there 
were 121 professional network offices located 
in key parts of all sections of the United States. 
This precedence could perform an important 
function in that it begins to indicate a proto- 
type on the basis of which a larger automated 
massive acquisition, storage, retrieval and dis- 
semination system could be developed. As I 
would envision the system toward which we 
hope to move, every individual unemployed in 
the United States who registers either for a 
job at the U.S. Employment Service or at a 
private employment service, or for unemploy- 
ment insurance would have information filed on 
the nature of his particular skill, educational 
background, work experience, age, and other 
social and economic characteristics of impor- 
tance for job location. Such information would 
be coded in terms of a number, such as the 
Social Security number, so that in the event 
that such an individual was able to obtain a 
job for which he was qualified, that individual’s 


availability for a job would be dropped from 
the storage system. In addition, all graduates 
of high schools, whether or not they were seek- 
ing jobs, would also be listed in the same man- 
ner on the assumption that they might possess 
skills for which jobs would be available in the 
United States. 

On the employer side, all job vacancies would 
have to be reported to the U.S. Employment 
Service indicating the expected duration of the 
vacancy, the nature of the skills indicated as 
necessary to fill the vacancy, and other social 
an economic characteristics of the individual 
for whom the vacancy might he a possible 
source of work. In addition, all expected va- 
cancies within a 12-month period would have 
to be listed by employers, thus providing an 
early-warning system of expected unemplo 
ment in a specific industry and community. 
This information would be gathered at each of 
the local U.S.E.S. offices in each of the states. 
Each state would then have a central collection 
agency which would then put all of this infor- 
mation, coded properly for identification pur- 
poses, on tapes from which the information 
would be brought together in a central skill- 
job-locator system. This locator system would 
quickly match jobs available — no matter where 
the job may be found in terms of geographical 
area and industry — with the skills which have 
been posted for each of the individuals seeking 
work. The matches which fall out of compari- 
sons of skills and job openings would then pro- 
vide a means for communication without delay 
to the concerned individuals. Communication 
would take place simultaneously, as a matter of 
fact, in order to permit both the employer and 
the prospective employee to make contact with 
each other. Depending on the scarcity of the 
skill and the proximity of the employer to the 
employee, there might be indicated the pro- 
vision of mobility funds, either from industry 
or from the government. But in any case, at 
least there will have been achieved a matching 
of a job opening and the individual who is seek- 
ing that particular sort of job, or for which his 
skills provide him with a basis for doing the 
job. This second point is important because 
more and more we have begun to recognize the 
need for transfer from one particular occupa- 
tional skill to another. In some cases, the skill 
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is such that we can move quite easily between 
industries ; hence it"s important that the infor- 
mation which is stored is of such a fundamental 
or generic nature that the language of the job 
description itself does not fall into folkways 
or traditional terms which might limit place- 
ment on the basis of a restrictive job descrip- 
tion ‘‘cliche’' rather than on the basis of skill 
descriptors which may qualify an individual 
for a number of jobs with completely different 
titles. 

What I have discussed thus far would be an 
ambitious program to get under way immedi- 
ately on a national level. However, a great deal 
can be done at the local level. By local level, I 
would begin within the confines of a major 
metropolitan area rather than on a larger re- 
gional basis. By starting at the local level, 
there are several assets. To begin with, the 
local U.S. employment services does, to some 
degree, although a limited one, know the local 
labor market. It does avail itself of a certain 
limited number of information inputs from em- 
ployers with regard to contemplated changes in 
the labor force. It also, to some degree, makes 
use of what information is available on the 
nature of forthcoming graduates from voca- 
tional education programs and the nature of the 
skills which they may possess. Building on this 
rather primitive basis, we can begin to design 
at the local level a more formalized system of 
information acquisition, storage, retrieval and 
dissemination with respect to labor market 
structure and needs, both on the supply and 
demand sides. An additional asset, if we start 
at the local level, results from the fact that 
co-operation between employment services of 
the individual states varies rather widely and 
in most instances we find that the individual 
U.S. employment services within a state act on 
an autonomous basis. In some instances there 
is a bonus which results from the fact that some 
school systems have already begun to develop 
formal and standardized procedures for storing 
information on graduates. This is done with 
the objective of following up on graduates after 
their departure from the school system in order 
to determine what the nature of their work in 
the job world was after leaving school and the 
degree to which the training, especially voca- 
tional, affected their success in finding work as 


well as the nature of thp work which they found. 
These information systems can be taken over 
and utilized for some of the initial information 
inputs which will be necessary on graduates as 
well as non-graduates coming from the school 
system and entering into the labor force. 

In addition to the use of computer systems 
for providing a means of communication be- 
tween the demand and supply sides of the labor 
market, there is also a fascinating opportunity 
to utilize the computers as a current analytical 
tool for determining the relationships between 
various economic, social, and educational char- 
acteristics of the employees or unemployed labor 
force in the population and the degree to which 
these characteristics affect mobility, job-seek- 
ing patterns, ease of placement, labor turnover, 
and other factors which are of prime considera- 
tion in the development of manpower policies. 
At the present time, we know very little con- 
cerning these sorts of factors and the degree 
to which there are correlations between these 
factors and other behavioral characteristics and 
placement possibilities of the unemployed as 
well as the employed in the labor force. By 
use of computers, we may also be in a far better 
position to determine the degree to which indi- 
viduals with a broad array of skills may be 
underemployed in our economy. Once having 
established the basis for gathering information 
and communicating thjs information on individ- 
uals seeking jobs as well as employers seeking 
skilled individuals for jobs which are available; 
there is the potential for continuing to gather 
information w^hich can be coded on the basis of 
Social Security numbers with regard to types 
of skill, hours of work, units of output or pro- 
ductivity, and other such economic and social 
characteristics while the individual is actually 
employed. This information can be collected, 
collated and analyzed on a current basis and, 
with such information being available, there is 
the possibility of our being able to move away 
from the rather limited sample which we now 
use to measure unemployment in the United 
States. Further, it presents us with the in- 
triguing possibility of being able to put a tracer 
on individuals who have been displaced for va- 
rious reasons from an industry, the degree to 
which they find new employment, the nature of 
the new employment which they find and the 
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levels of wages which these individuals are able 
to obtain in these new forms of employment. 
This is particularly intriguing because, at the 
present time, the major argument raging be- 
tween economists concerning whether or not 
unemployment is due to structural factors, in- 
cluding technological unemployment, or too low 
a level of aggregate demand has been subjected 
to no truly rigorous research treatment. Each 
side of the argument obtains major support 
from deduction, logic, ^nd scanty data rather 
than from large-scale analytical research pro- 
cedures involving survey techniques and di- 
rectly relevant primary data. Before closing, 
I would like to suggest one additional fascinat- 
ing potential application of computer technol- 
ogy to a difficult manpower problem — >that of 
designing more effective training systems. 
Among these systems I would include both vo- 
vational training at the high school level and 
adult re-training at the post high school level. 
If we regard this sort of training as a schedul- 


ing of a mix of sequential, overlapping and 
concurrent phases, where the optimal situation 
is one where we can forecast time “bottlenecks” 
and choose options which shorten or eliminate 
these bottlenecks, we are really considering the 
application of P E R T and critical path method 
techniques for training and curricula design 
problems. The scheduling of education, very 
much like the scheduling of a Polaris project, 
calls for the same awareness of the trade-off 
potential between time and money and the de- 
velopment of P E R T network or critical paths 
which permit the design of a system more con- 
cerned with producing an end product in less 
time rather than being concerned with a mar- 
ginal savings of funds. This is not only of sig- 
nificance with respect to scientists and engi- 
neers but also with regard to potential juvenile 
delinquents and heads of families who, without 
work or meaningful job roles, represent serious 
personal and social costs with which we are 
already becoming familiar. 
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Many of us are gradually becoming aware of 
the fact that an enormous number of Americans 
live in abject poverty. While estimates of their 
number vary, it is certain that millions of people 
in this country live in chronic need, perhaps as 
many as one out of every four of five citizens.^ 
Children, the sick, and the aged constitute a 
large part of this number ; but also included 
among them are approximately four million un- 
employed Americans on whom millions of addi- 
tional citizens once depended for their support. 
Many of these people are victims of automation 
and the computer revolution. John Snyder, 
President of U.S. Industries, has estimated that 
automation is a major factor in displacing 

40.000 workers per week. The Department of 
Labor, which is more conservative, estimates 
the rate of displacement through automation at 

4.000 per week. 

But poverty is not the only threat to those 
who have been displaced by automation. Even 
if the unemployed are provided an adequate 
standard of living, millions will still be threat- 
ened by psychological problems which have their 
roots in two conditions of the contemporary 
American scene. First, most unemployed people 
hold values based on the Protestant ethic — 
values which are ill-suited to a world in which 
there is not enough work to go around. And 
second, there is the machine itself, particularly 
the computer, which is presented to the worker 
as being faster, more accurate, more reliable, 
and in short, better than he is. In this essay I 
would like to focus not on the physical plight of 


these people but on the psychological problems 
they face. 

Underlying many of the psychological prob- 
lems which the unemployed face in trying to 
adjust to their condition is the fact that the 
prevailing value system in this society places 
great stress on the virtue of work. Although 
many observers have commented on this, one of 
the best known was Max Weber who called it 
the Protestant ethic.^ The central notion in 
the Protestant ethic is the idea that labor is 
noble whereas idleness is im_m_oraL The fact 
that Weber, a German, chose an American, 
Benjamin Franklin, as the exemplar of the 
Protestant ethic emphasizes the degree to which 
this country is uniquely identified with it. 

The aphorisms of Franklin are rich in their 
reference to the m_oral value of a man’s labor. 
Franklin, at one point in his career, set out to 
distill the essence of moral perfection into 13 
virtues, one of which was industry,^ “Lose 
no time,” he advises. “Be always employed in 
something useful; cut off all unnecessary ac- 
tion.” 

In Poor Richard’s Almanac, Franklin makes 
these additional observations on work and idle- 
ness ; 

“At the working man’s house hunger looks 
in, but dares not enter.” 

“Plough deep, while sluggards sleep, and 
you shall have corn to sell and keep.” 
“Trouble springs from idleness.” 
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No twentieth century American is entirely 
free of his cultural ties to Benjamin Franklin 
or the Protestant ethic. Despite occasional jokes 
to the contrary, most of us feel uneasy when we 
are away from the daily working routine too 
long. Indeed, as our jobs become more abstract 
and it becomes more difficult to point to concrete 
products resulting from our most intensive 
labors, many of us are plagued with doubts. 
Are we in fact needed? Are we paying our 
way? Or are we perpetrating a gigantic hoax 
on a world too confused to recognize what it 
does, in fact, need ? 

Our uneasiness is rooted in the fact that work 
is the principal avenue by which Protestant man 
fulfills his potential as a unique and creative 
human being. Although our daily life is filled 
with numerous activities, work and sleep con- 
sume the largest part of our days; and in no 
other sphere is the potential for self-realization 
so great as work. To question my work is to 
threaten my value as an individual ; to deprive 
me of work is to take from me the opportunity 
to give meaning to my life — the opportunity to 
achieve and to be recognized by my fellow 
human beings. The important point to note here 
is this: a man’s labor does more than supply 
the necessities of life; in this culture, it also 
feeds hidden and only dimly understood psychic 
needs as well. Though a man may have more 
money than he requires to satisfy his own needs 
and those of his family, he still feels impelled 
to work. Work is an autonomous activity : it is 
good in and of itself, having a value surpassing 
the immediate ends it serves. ‘‘God gives all 
things to industry,” says Poor Richard. To not 
work, on the other hand, is bad. “Be ashamed 
to catch yourself idle,” as Poor Dick says. 

Though it may be otherwise on some far off 
Pacific Island, in this culture the average man 
who does not work feels guilty and useless, and 
this, unfortunately, is true even though he is 
unable to work through no fault of his own — 
because he is too sick or too old or too young or 
too poorly trained. 

While the unemplo 5 rment statistics do not tell 
the whole story, they suggest that there is a 
great deal of psychological, as well as physical, 
misery in this country.^^ The plight of such 
“special” manpower groups as the young, the 


old, and the minorities is particularly tragic. 
Unemployment rates for teenagers are three 
times the national average and have risen 
almost 70% in the past five years. Older 
workers — those over 45 years old — are similarly 
disadvantaged, finding it difficult to find a new 
job once they lose their existing one. They ac- 
count for a disproportionate percentage of the 
long-term unemployed, and many older workers 
“retire” prematurely to avoid the psychological 
stress of hunting for non-existent jobs. 

Minority groups are hardest hit. Unemploy- 
ment rates for non-whites are in almost all cases 
twice the rate for whites — regardless of educa- 
tional level, age, sex or skill. This means that 
non-white unemployment rates for teenagers 
amount to about one potential worker out of 
three ; and in some areas, only one youth out of 
two has found a job. 

One could go on citing such statistics as these 
endlessly, but that is really unnecessary. These 
data point up the central fact that millions of 
people are caught in a fearful conflict. They are 
unable to find jobs in a culture where the pre- 
vailing ethic is “work or be damned.” 

These statistics appear to have had very little 
impact on the drive to replace men with ma- 
chines. Indeed, the moral imperatives which 
guide the computer revolution seem to take for 
granted that man will somehow bungle through 
if the machine is properly tended. Writing in 
Harper's in 1951,^ two Canadian physicists, 
E. W. Leaver and J. J. Brown, suggest that 
widespread automation will bring an era of 
peace and creative human development. Their 
basic principles, which will presumably bring 
about this new era of peace and prosperity, are 
notable for their emphasis on the machine, 
rather than on man himself, as the source of 
human salvation. 

1. Machines should replace men wherever 
possible. 

2. Men should not be used for routine opera- 
tions if machines can do the work. 

3. Automaticity of machines should be en- 
couraged- 

4. Automaticity of men should be dis- 
couraged. 

5. Men must be ancillary to machines. 
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To a generation which has grappled with suCh 
earth-shaking problems as the control of nuclear 
weapons, the belief that machines should replace 
men or that men should be ancillary to machines 
may seem trivial. But in point of fact, it may 
be as revolutionary a change in the beliefs of 
man about himself as the shift which occurred 
with the widespread acceptance of the heliocen- 
tric rather than geocentric view of the universe. 

Before the general acceptance of the heliocen- 
tric view of the solar system, most people saw 
a special significance in the fact that the sun 
and stars apparently circled the earth ; it seemed 
to imply that man, who was obviously first on 
earth, must therefore enjoy a special relation- 
ship to the entire universe. But even then, men 
could still look around them and find in their 
own handiwork something special, something 
which set them apart from other living things. 
Veblen, in fact, considered a sense of workman- 
ship chief among the instinctive dispositions of 
man."^ The new doubts which the machine 
has cast on the capabilities of man are sufficient 
to shake his faith in himself to the very roots. 

In the past, as machines extended human 
capabilities, they freed m_en from_ much of the 
drudgery that characterizes more primitive so- 
cieties. Even today in many underdeveloped 
societies, virtually every member must devote 
all of his waking hours to the finding and prep- 
aration of food. There is not enough excess 
capacity in these societies to free more than a 
few individuals for other kinds of activities. 
Under such conditions, machines make good 
economic sense. While the rationale behind the 
quest for newer and better machines is still 
largely economic, we seldom ask what we are 
“freeing*’ men from and what we are “freeing” 
them for. 

Underlying the relatively high value which 
we place on the machine is the notion that men 
really aren’t very efficient mechanisms. Ma- 
chines can do many things better than men, and 
some would assert that machines can do most 
things better than men. We are told that they 
are faster; that they are more accurate; that 
they are more dependable ; and there are those 
who believe that computers rather than men 
should really make many decisions. 


It would be well to note that it makes very 
little difference, for the purpose of this discus- 
sion, whether the machine is or is not superior 
to man. In many ways the issue is academic. 
What really matters is the extent to which the 
average person is corivinced that the machine 
will replace him because it is better ^ than he is. 
With the enormous prestige of science standing 
behind us, I submit that we have unwittingly 
convinced the average man (who may not in 
any case hold himself in very high esteem) that 
he is in fact not really worth very much. This, 
I fear, is a most serious mistake, first because 
of the impact it will ultimately have on our cul- 
ture, and second, because it is not true ! 

Clearly the proposition that men should be 
replaced by machines because they are better 
than he is inconsistent with the Protestant ethic. 
If a man must work to feel worthy and needed 
and we persist in eliminating him, then the 
result must inevitably be a profound and pos- 
sibly disastrous change in the fabric of our cul- 
ture. While no one can predict precisely the 
social consequences of this state of affairs, most 
social scientists to whom I have talked are not 
optimistic. 

To the sociologist, the disparity between the 
prevailing Protestant ethic and the realities of 
modern life as experienced by many unemployed 
people leads to an alienation between the in- 
dividual and his reference group. Emile Durk- 
heim ^ in his comprehensive study of suicide 
used the term “anomie” to describe the condi- 
tion in which an individual is constantly forced 
to compromise his established values when they 
conflict with the existing reality — a condition 
which inevitably results in a feeling of separa- 
tion and rootlessness. 

From the viewpoint of the psychologist, most 
problems are rooted in conflict and frustration. 
When the society places inconsistent demands 
on the individual, insisting, for example, that 
he both work and accept the fact that jobs are 
not available, he struggles to resolve the incon- 
sistency. In the absence of more effective re- 
sponse alternatives, the individual can be 
expected to invoke certain well established psy- 
chological mechanisms in order to adjust to 
continuing conflict and frustration. Aggression, 
regression, fixation, and in the face of prolonged 
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frustration, apathy, withdrawal and resigna- 
tion, are mechanisms used to some extent by 
normal individuals. But when tension persists 
despite all efforts to remove it, and the individ- 
ual resorts to inappropriate psychological mech- 
anisms as a “way of life,^' the consequences 
are potentially very serious for his mental 
health. 

The clash between the Protestant ethic and 
automation threatens to place literally tens of 
millions of Americans into what is essentially 
an irresolvable conflict situation. In the absence 
of socially acceptable alternatives, the most 
likely result of this is the relatively permanent 
adoption of one or more inappropriate psycho- 
logical mechanisms. This is basically an un- 
healthy state of affairs. 

Many advocates of the machine-over-man po- 
sition declare that we wil] automatically adopt 
healthy alternatives — that we are entering a 
new era where the workman will turn to culture 
and the arts, thereby resolving the conflict in a 
socially acceptable and useful way. This seems 
unlikely. 

According to statistics published by the De- 
partment of Labor, the following educa- 
tional levels can be expected to prevail in the 
1960's: 

For youths, if current trends continue, ap- 
proximately 30 per cent of those entering the 
labor market (7^4 million young people) will 
not complete high school. Approximately 2^4 
million of these will not even enter high school. 

For non-white workers, the situation is even 
worse. As of March 1962, one out of every three 
non-white workers had not completed an ele- 
mentary education, and only one in five had 
completed high school. 

These educationally deprived groups, who will 
be the hard-core unemployed of tomorrow, do 
not appear to be particularly promising candi- 
dates to lead America in a new cultural ren- 
aissance — nor do the masses of older employed 
Americans who suddenly find themselves dis- 
placed by machines. 

In Oklahoma City, in 1960, an intensive effort 
was made to help displaced workers after the 
closing of the Armour plant. One hundred and 


seventy people were tested, but only 60 showed 
promise for retraining. As a matter of fact, out 
of a total of 431 workers invited to be inter- 
viewed, only 143 men and 27 women completed 
both tests and interviews.^® 

In another recently published study,!^ the 
Labor Department compared the characteris- 
tics of all unemployed workers with those of 
30,650 trainees enrolled in Federally financed 
retraining programs. Older workers are poorly 
represented in such programs. Only 5.7 per cent 
of trainees are over 45 years of age, and yet, 28 
per cent of the unemployed fall into this age 
group. From the report, it is not clear why these 
people are so poorly represented, but one sus- 
pects that they are simply not prepared to build 
new lives for themselves through retraining. 
Whenever they are called upon to change their 
ways dramatically, older people must overcome 
a lifetime of learning and powerful habits built 
up through years of conditioning. The report 
notes that many older people are undereducated 
and unskilled and have little to offer the econ- 
omy. It is reasonable to suppose that many of 
these people are undereducated because even in 
youth they were not highly motivated to learn, 
and that many of them simply have no wish to 
learn now. On the other hand, prodded by the 
Protestant ethic, they do have the desire to play 
a useful role in this society. It seems completely 
naive to believe, as some apparently do, that 
this need can be satisfied by social activities 
which demand a radical reorientation of their 
existing modes of behavior. 

The new society will provide the potential 
for additional leisure, but a large proportion 
of the population is unfortunately ill-prepared 
for leisure. They are ill-prepared, first, because 
leisure is antithetical to the Protestant ethic, 
and second, because the enjoyment of leisure 
requires preparation for it. Relaxation may 
come naturally to us as children. But the ability 
withers under the regimentation of a civilized 
society, as evidenced by the emotional and 
psychological preparation required by most men 
who have worked all their lives and who are 
suddenly faced with the full-time leisure of 
retirement. 

Will the unemployed voluntarily turn to edu- 
cation in search of a solution to their conflicts ? 
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While I am not" aware of any research directed 
at answering this specific question, a recent 
study ^2 by the National Opinion Research 
Center provides some interesting information 
about the kinds of people currently participat- 
ing in adult education programs. Although one 
out of five adults follows some plan for leisure- 
time education, the ‘‘typical’’ participant is 
apparently not one of the “other Americans.” 
Indeed, the “typical” participant is described 
by the authors as a man or woman who has com- 
pleted high school, has an above-average income, 
a full-time white collar job, and is a white mar- 
ried Protestant living in a city or suburb. 

What very few people seem to fully under- 
stand is this: large numbers of unemployed 
people lack the most elementary academic skills, 
such as the ability to read well. In view of the 
fact that millions are insufficiently trained to 
find work, or even warrant retraining, and that 
many more are past their prime, fixed in their 
ways, and poorly adapted to change of any kind, 
it seems unlikely that they will turn to cultural 
interests. 

Nor do these remarks apply solely to the 
laborer. It would be foolish to believe that the 
computer revolution will stop with the man in 
the denim shirt. On the contrary, white-collar 
automation is at least as great a threat as blue- 
collar automation. Today, white-collar workers 
being replaced are those who are engaged in 
basically routine tasks, but if the past is any 
guide to the future, the “clip level” will gradu- 
ally rise to encompass literally millions of addi- 
tional white-collar workers. In the crudest 
terms, computers are installed because they 
make economic sense, and they make the most 
economic sense where they eliminate high-cost 
manpower. Managers aren’t installing comput- 
ers for the fun of it; they are installing them 
because they save money, largely by doing tasks 
which would otherwise be performed by 
humans. To review each of the areas in which 
this is true would be both tedious and unneces- 
sary. Airlines, banks, brokerage houses, manu- 
facturing firms — ^virtually all sectors of the 
American economy — ^are discovering the simple 
fact that computers cut labor costs. 

Whether white-collar workers will turn, en 
masse, to the classics, art, music, or even do-it- 


yourself projects, is not easy to foresee. But 
even if millions do turn to cultural pursuits, it 
is safe to predict that millions will not. What 
of them? Like their friends in denim shirts, 
they too are products of a Protestant ethic. 

To summarize, millions of unemployed Amer- 
icans, and millions more who may soon be dis- 
placed by automation, are psychologically 
threatened from two directions at once. First, 
there is the threat created by the fact that they 
hold a value system — ^the Protestant ethic — 
which is contradictory and ill-suited to the 
world in which they find themselves. And sec- 
ond, there is the threat presented by the ma- 
chine itself — ^the computer — which makes the 
individual feel insignificant and inferior by 
comparison.* 

What are we to do about the psychological 
problems I have described ? 

I have shown that the continuing process of 
automation challenges one of the most funda- 
mental factors in the psychological makeup of 
the American individual. While we are proba- 
bly unwilling and even unable to stop automa- 
tion, it is within our power to change the nature 

* Although I am convinced that what matters here 
is not whether machines are really superior to men but 
what people believe to be true, I cannot leave this topic 
without making one or two comments. Many authorities 
appear to feel that machines will some day simulate all 
of the important properties of human intelligence. As 
a psychologist, I am frankly a skeptic. 

In a recent article published in Datamation, i Paul 
Armer deplores the fact that we keep redefining intel- 
ligence so that it is always just out of the reach of the 
machine. The fact of the matter is ‘intelligence” and 
“thinking” have always been concepts just out of the 
reach of scientific psychology, as well. We don’t really 
know how to define them in an entirely satisfactory 
way. Indeed, that is the problem. I’m sure that if these 
terms are ever defined in a way which encompasses the 
full richness of the phenomenon to which they generally 
refer, then it will be possible to simulate them. 

But, to be satisfactory, the definition will have to 
include some extremely frustrating aspects of human 
behavior. For example, any really adequate definition 
of thinking will have to recognize the fact that percep- 
tions are selective and shaped by past experience; that 
the mind wanders down strange and wonderful unpro- 
grammed paths; and that the most creative thoughts 
are sometimes elicited by the most illogical and bizarre 
associations. In short, the very features of man which 
make him slow and unreliable may be his most valuable 
properties. 
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of automation by making it more sensitive to 
human variables. To the extent that our anal- 
yses of the benefits of automation ignore human 
costs, they are incomplete. There are costs asso- 
ciated with retraining, with the support of the 
unemployed, with social dislocation, and with 
the psychological misery of the displaced 
worker. In general, when the benefits of auto- 
mation are calculated, these costs are ignored. 
Taxation may spread such costs over a broad 
base, but they are still enormous — and they will 
continue to grow as automation expands. It is 
important that computer scientists become 
sensitive to these human and social costs so that 
systems analyses include all of the relevant 
variables. 

Second, although the task will take decades, 
we must begin to change the Protestant ethic. 
Changing the Protestant ethic will involve a 
dramatic reorientation of our society. Most 
important, we will have to learn to live with 
the fact that the day is not far off when com- 
puters and improved machines will make it im- 
possible for ever-increasing numbers of Ameri- 
cans to fulfill their potential through work. 
Many of our children can expect to spend a 
large part of their lives in leisure. To this end, 
public education should modify its curricula to 
prepare our children and grandchildren not only 
for work but for the more profitable use of free 
time as well. Unlike many who see in every 
subject which is not blessed by Admiral Rick- 
over a direct threat to the American way of life, 
I believe we should begin to emphasize in our 
schools the constructive use of leisure by en- 
couraging our young people to participate in 
the full range of life's activities, including art, 
music, drama and similar fields which today are 
seen by many to be frills. Also, the society 
should recognize and reward excellence in these 
fringe areas as much as it does in more tradi- 
tional fields. While I do not believe we should 
turn our backs on science, we should not allow 
it to consume us either. 

Third, we must re-examine the economic base 
of our society. Up to the present time, we have 
forced man to work if he wished to receive an 
income. If we can no longer provide work in the 
traditional sense for everybody, we must recon- 
sider how the unemployed are to be provided 
with resources and what volume of resources 


they should receive. Already, some writers have 
suggested that every individual will have to be 
given an absolute right to an income adequate 
to live his life in dignity.® 

Fourth, there is the requirement that we edu- 
cate every citizen to the limit of his abilities. 
To the extent that people are needed at all in 
the emerging, new society, they must be edu- 
cated. Most of those who are not educated will 
be cast ruthlessly aside. Furthermore, educa- 
tion is a profitable way to consume time which 
might otherwise be spent in antisocial or so- 
cially maladaptive ways. Therefore, the right 
to an education is an essential guarantee — not 
only for the protection of the individual but for 
the protection of society as well. 

Finally, we need more information about the 
problems of the unemployed. We are in need 
of better psychological data about the displaced 
worker, his attitudes toward society, the way 
he spends his time, and his fears for the future. 
We must try to determine the extent to which 
the conflicts I have postulated lead to an aliena- 
tion of the individual from society and how we 
can prevent this from happening. We need to 
know more about how to prepare people to ac- 
cept change, particularly older workers. We 
must refine our tools for predicting future 
trends so as to properly prepare our young peo- 
ple for the world they will someday face. 

Carl Jung 5 once remarked that in science 
‘‘the individual man and, indeed, all individual 
events whatsoever suffer a leveling down and a 
process of blurring distorts the picture of reality 
into a conceptual average." In the scientist's 
search for lawfulness, he abstracts away com- 
plexity and reduces the individual to a statistic. 
So it is with all of the statistics which I have 
cited. The individual is lost. But if we look be- 
yond the abstraction to men, as living, breath- 
ing individuals, there is much to be concerned 
about in this affluent society of ours. 
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INTRODUCTION 

This paper will discuss a new mathematical 
technique for the solution of nonlinear dilferen- 
tial equations. The types of equations and non- 
linearities presented are those associated with 
feedback control systems. Originally developed 
as a method for simulating control system_s, the 
technique was verified during a recent study 
of complex aircraft design simulations. It is 
now being used in a man-in-the-loop Gemini 
spacecraft simulation. 

Since these design simulations include man 
as a component part, it is imperative that they 
be performed in real time. For many years, 
the analog computer with its parallel comput- 
ing ability has been used for the real-time de- 
sign simulation of aircraft. In more recent 
years, the analog computer has assumed the 
same role for missiles and space vehicles. Pres- 
ently, aircraft systems have become more com- 
plex, space vehicle missions require more hours 
of simulation time to complete a maneuver, and 
high-performance missiles require more logical 
decisions in their phases of flight. Because of 
these conditions, it has become more difficult 
to expand, use and maintain the large analog 
systems which are required to perform real- 
time simulation. 


Until the present technique was developed, 
a general purpose digital computer was unable 
to achieve a real-time solution for the more 
complex design simulations. The primary ob- 
stacle to the digital approach was the excessive 
computing time required for simulating the 
vehicle's control system and rotational motion 
equations by available numerical techniques. 
The roll, yaw and pitch channels of the aircraft 
used as an example are represented by fifteen 
linear and nonlinear differential equations in 
addition to algebraic equations. The highest 
input frequency is approximately three cycles 
per second; the maximum transient response 
frequency is 60 cycles per second. The solution 
time for these equations on the IBM 7090 by 
Runge-Kutta integration is 3.0 seconds for each 
second of flight time. When solved by the new 
technique, the same equations require only 0.17 
seconds of computation time for each second 
of flight time. This solution-time reduction, in 
addition to a reduction in the computation time 
for the vehicle dynamics equations, permit3 
a digital computer to perform the simulation in 
real time. 

This paper will describe the theory of the 
new technique as applied to the flight simula- 
tion problem. Autopilot and power controls 
equations, and vehicle dynamics equations will 
be used as examples. 
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Figure 1. Transfer Function Diagram for Aircraft Roll Channel. 


PROBLEM DEFINITION 

Figure 1 illustrates the Laplace transfer 
function diagram of an aircraft roll channel.* 
The yaw and pitch channels are of similar form 
and will not be discussed. This transfer func- 
tion diagram represents the mathematics of 
the vehicle's roll rate as a function of autopilot 
(^,.) input. Additional inputs are applied at 
point Xr,. The first two inner loops contain non- 
linearities in the form of limiting devices. These 
devices limit the magnitudes of the variables 
X 4 and Xh to 6.55 and 20.0, respectively. The 
first inner feedback loop is a high-frequency 
control unit which accepts autopilot signals as 
input. Its internal frequency response is up to 
60 CPS and drives the second inner feedback 
function. This second feedback transfer func- 
tion is a low-frequency unit and is the mathe- 
matical representation of the hydraulic unit 
of the roll channel. 

The next to last block gives the dilferential 
equation for the roll acceleration. The equation 
contains continuous nonlinearities in the form 
of the time-varying coefficients Ci, C„ and a, 
and the crosscoupling terms, r and q. The time- 
varying coefficients are functions of altitude, 
speed, angle of attack, etc., while the crosscou- 
pling terms introduce the effects of yaw and 
pitch on the roll of the aircraft. 

The problem of concern is the solution of the 
nonlinear differential equations for the roll 
channel (yaw and pitch also) at a solution rate 
which will give results that are accurate for a 
design analysis, and yet permit real-time opera- 
tion of the entire simulation. 

* Laplace transfer functions rather than differential 
equations will be used to specify the mathematical 
models. The use of the new technique to be discussed 
is greatly aided by starting with transfer functions. 


SOLUTION METHOD 

The nonlinear differential equations are 
solved by an appropriate set of difference equa- 
tions. These difference equations are derived 
by first computing a z transform! of the Laplace 
transfer diagram and then reducing the trans- 
forms to simple difference equations. The z 
transforms are used in a manner which differs 
considerably from their normal usage. By prop- 
erly computing z transforms for the linear 
portions of the transfer diagram, and then com- 
bining the linear portions as though they were 
separated by samplers, it is possible to arrive 
at a transformation which, when reduced to 
difference equations, will permit an accurate 
and rapid solution of the nonlinear system. 

Figure 2 is a feedback control system with a 
nonlinearity (the limiter) in the forward loop. 
The limiter prevents the magnitude of the vari- 
able X 4 (S) from exceeding the limit Ki,. The 
limiter behaves as a variable gain element. For 
the nonlimiting mode, the gain K is unity and, 
when limiting occurs, the gain is such that: 
X 3 (S)K = K,, = X 4 (S). When the system is 
operating in the nonlimiting or linear region, 
the forward loop transfer function is 
Gi(S)G 2 (S) and the system transfer function is 

Xr.(S) _ G.(S)G 3 (S) 

X.(S) “ 1 -h G,(S)G 3 (S)H(S)' 

For this nonlimiting mode, a z transform of 
the total transfer would be computed as 

G,(S)G 3 (S) 

1 + G,(S)G 3 (S)H(S)J • This transfer 

tz is the sampled data variable and is equal to 
where S is the complex Laplace variable and T is the 
sampling period. For further information, refer to 
sources listed in the Bibliography. 
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Figure 2. Feedback Control System with Limiting. 


function has the correct roots and steady-state 
gain for a single impulse input. For any other 
input, the steady-state gain would need adjust- 
ment. This would necessitate adjusting the 
numerator polynomial coefficients. The denomi- 
nator polynomial would be correct and would 
not need adjustment. 

For the limiting case, the forward loop trans- 
fer function would no longer be Gi(S)G2(S), 
but KGi(S)G 2(S) where K < |1|. The limiter 
acts as a variable gain element. When X;^(S) 
becomes larger than some predetermined value, 
limiting occurs such that 1X4(8) | — Kl. Thus, 
for a particular sampling instant and limiting 
gain Kl, the system is considered linear and 

Xr (S) 

the transfer function becornes = 

Xi (b) 

The resulting z trans- 

Gi(S)G2(S) 1 

iG(S)G(S)H(S)J ‘ 

Thus, for both the limiting and nonlimiting 
cases, the system at any sampling instant may 
be considered linear. This leads to the concept 
of a piece-wise linear system. The main prob- 
lem which remains is to determine when the 
limiting occurs, and it is here that a departure 
from standard z transform techniques is made. 


KiGi(S)G2(S) 


1 + KiGi(S)G2(S)H(S)* 


form would be Z 


.1 


Kl 


+ K 


Some provision for stopping and checking for 
the limiting case must be made. This can be 
done by considering each element of the system 
separated by a sampler as shown in Figure 3 . 

In this form, the calculations proceed around 
the loop, one transfer function at a time. On 
each pass through the forward loop, the test 
for the limit is made before proceeding with 
the calculations. In this form, the system 
transfer function for the nonlimiting case be- 

X,(z) Gi(z)G.(z) 

POTYIP^ ^ — — — — 

X,(z) - 1 + G,(z)G.(z)H(z) ’ 

which does not correspond with the nonlimit- 

ing transfer function Z [ i +G,'(S)fefs)H(S) ] 

calculated previously from the Laplace system. 
The roots and gain of the transfer function will 
not agree with those of the z transform calcu- 
lated from the Laplace transfer function. How- 
ever, by appropriate gain adjustments in the 
forward and feedback loops, it is possible to 
adjust the z transform so that the roots and 
steady-state gain match. The difference equa- 
tions obtained from this corrected z transform 
will give solutions which are within the accu- 
racy required for design simulations. 

Figure 4 shows the transfer function dia- 
gram of a feedback control network which will 
be used as an example to illustrate the tech- 
nique. 



Figure 4. Linear Feedback Control System. 



Figure 3. Sampled Data Feedback Control System. 


tThis equation is written in operator notation. The 
capital Z indicates the z transformation operation. 
Thusf(z) = Z [f(S)]. 


Since the system of Figure 4 is linear, an 
exact difference equation may be derived. The 
method consists of multiplying the Laplace 
transform of the input by the transfer function 
to obtain the output transform. The output 
transform is transformed into z transform 
notation. By dividing this output transform 
by the z transform of the input, the z trans- 
form transfer function is determined. This 
transfer function is then used to compute a 
difference equation which will give an exact 
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m 



Figure 5. Arbitrary Input Sampled by Zero- 
Order Hold. 


solution for the particular input used. How- 
ever, for real-time design simulations the in- 
puts are, in general, unknown and this method 
cannot be used for computing the difference 
equations. This difficulty is resolved by com- 
puting z transforms for step inputs or pulses 
of finite width, where the width of the pulse 
is equal to the sampling period. Any arbitrary 
input may be approximated by a stair-step in- 
put. 


Figure 5 illustrates a function which has 
been broken up into steps. 


This type of sampled input is obtained by a 
zero-order hold, and introduces, on the average, 
one-half of a sample period lag in the input. 
This lag is compensated by a lead which is 
characteristic of the z transform method. The 
transfer function of Figure 4 is 


X5(S) G(S) _ 5 

X,(S) ~ 1 -t- G(S)H(S) S* 4- 2S -}- 5 


The output transform for a unit step input 
is calculated by multiplying equation (1) by 

the unit step transform 


X.,(S) 


^ ( 2 ) 

~ X,(S)^ S “ S(S2 4- 2S + 5) 


Equation (2) is expanded by partial frac- 
tions so that standard z transform tables may 
be used. 


XflfS) - g^g2 2S 4- 5) ~ S 

^ + . i I (3) 

S2 + 2S + 5 S2 + 2S + 5 ^ ^ 

Equation (3) may now be transformed into 
z notation through the use of transform tables. § 
The sampling period used is T = 0.05 seconds. 


This rate is much higher than needed to pro- 
duce satisfactory results. The choice of a solu- 
tion rate is governed by the frequencies of the 
input function and transient response. The 
transient response frequencies are calculated 
from the roots of the system transfer function. 
To reproduce the response at this frequency, 
a sampling rate greater than but not less than 
twice the frequency should be used. Since an 
arbitrary input function is sampled in a stair- 
step manner, a sampling rate must be chosen 
to minimize the error introduced by the stair- 
step approximation. This error may be made 
insignificant merely by increasing the solution 
rate. This error is not present for step inputs 
since the difference equations are derived for 
step inputs. Thus, for step inputs, the solutions 
are exact at the sampling instant, regardless 
of the solution rate. A general criterion for 
choosing a solution rate is to pick one which 
will give satisfactory simulation results. The 
rate is much less than required for standard 
numerical techniques. 

Y y X z z2 — e— cos (.l)z 

AsUj _ z 2 — 2e-®5 cos (.l)z+e-i~ 

5e— .05 gin (.i)z 

z2 — 2e-*®^ cos (.1) z+e— ' 


V . V z z2 — .94648Z 

— z2_i g9295z_^.904837 “ 
.04748Z 

z2— 1.89295Z+.904837 


Xs(z) = — 


z2 — .8990z 


( 6 ) 


z z2— 1.892954-.904837 

The system transfer function in z notation is 
computed by dividing equation (6), the output 
z 


response, by 


z — 1 

z transform notation 


, which is the step input in 


Xstz) X..(z) _ 

X,(z) - _j_ - 
z — 1 

z2— 1.8990Z4-.8990 
z2— 1.89295Z-I- .904837 ^ ^ 


Xstz) _ .00605Z4- .005837 

X,(z) Z2—1.89295Z-H. 904837 

The difference equation is computed by cross 


§ Refer to appendix for a list of basic z transforms. 


NEW DIFFERENCE EQUATION TECHNIQUE FOR SOLVING NONLINEAR EQUATIONS 173 


multiplying both sides of equation (8) and solv- 
ing for Xr>(z). 

z2Xr.(z) — 1.89295zX,(z) -f- .904837Xn(z) = 
.00605zX,(z) -t- .005837X,(z) 

(9) 

Xr.(z) = .00605 .005837 

z z 

-f 1.89295 — .904837 

' z z^ 

( 10 ) 

In z transform theory, represents a time 

z 

delay of one sampling period ; represents a 

z 

delay of two sampling periods ; and, in general, 

is a time delay of n sampling periods, or nT. 
z 

The inverse transformation of equation (10) 
back to the time domain thus becomes : 

X 5 (nT) .00605Xi(nT— T) + .005837Xi 

(nT— 2T) + 1.89295XanT— T) —.904837X5 
(nT— 2T) (11) 

Equation (11) gives an exact solution at the 
sampling instants, nT, for the output of Fig- 
ure 4 for step inputs (or for finite- width pulse 
inputs of width equal to the sampling period). 

A new method of calculating the difference 
equations will now be discussed. The new tech- 
nique is required for the case where a limiter 
is present. The same system of Figure 4 will 
be used. Normally, a limiter woud be present 
between the two elements in the forward loop 
of Figure 4. However, for the initial analysis, 
the nonlimiting mode is used and the limiting 
element can be replaced by an element of unity 
gain. The method, as stated before, is to com- 
pute z transforms for the linear portions, com- 
bine these portions for the overall transfer 
function, and then make appropriate gain ad- 
justments such that the steady-state gain and 
roots agree with the linear gain and roots. The 
transforms for the linear portions are calcu- 
lated directly from the transform tables with 
one modification. The transforms in the tables 
are transforms for a single-impulse input. Here, 
step inputs are used which, through the sam- 
pling process, become a train of impulses. The 
transforms for the single-impulse input do net 
give a correct value of steady-state gain for 


the impulse train. This difficulty can be resolved 
by using impulses of area T, the sampling 
period, or by multiplying the transform by T 
and using the unit impulse. The multiplication 
by T is a first approximation. The numerator 
will generally require further adjustment. For 
an integrator, the numerator is multiplied by 
T and no further adjustment is required. The 
integrator adjustment may be explained by re- 
ducing the sampling period. The z transform 
is a discrete process which approaches the La- 
place as the sampling period becomes smaller. 
Consider the Laplace transform for integration 

From the tables, the corresponding z trans- 
it 

2 

form is :r which is multiplied by T to get 

z — 1 

Tz 

- — J-. The sampling period is reduced. 

lim Tz _ lim Te*^'*’ 

T-»0 z— 1 ~ T-»0 e^’’— 1 ’ 

T(1+ST+S^T^+ . . .) 

lim Tz _ 2 

T-»0 z— 1 (l-fST-KS2T2-)- ...)— 1 ^ ’ 

2 


lim Tz T+ST2-i-l/2S2T»-f- . . . . _ 1 

T-^0 2^1 ~ ST-!-l/2S2T2-t- S 

(14) 

As the sampling period is reduced to zero, the 
z transform for integration becomes which 

is the Laplace integration transform as ex- 
pected. However, if the T had been omitted, 
the limit would have been infinity which is in- 
correct. The z transforms for the linear por- 
tions of Figure 4 are calculated from tables. 

The calculations for the numerator coefficient 
of equation (15) may be omitted and a constant 
K inserted. This coefficient will undergo a gain 
adjustment when the roots are corrected. 


_5 ^ T5z 

S-l-2 z — e— 1 


.25z _ 

z— .904837 ~ 

1151 

z_.904837 ^ ’ 


L Tz _ .05z 
S z — 1 z — i 


(16) 


The gain adjustment could have been made in 
the integrator coefficient with the same output 
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Figure 6. Sampled Data Feedback Control System — 
Loop Gain Unadjusted. 

results. The only effect would be a time shift 
in the response within the loop. The limiter 
would limit at a slightly different time. Figure 
6 shows the transfer function at this point. A 
delay of one sample period has been inserted in 
the feedback loop. Since the numerator and 
denominator of the forward loop transfer func- 
tion are the same degree, the corresponding 
difference equation relates the present value 
of Xr*(z) to the present value of X 2 (z). How- 
ever, X^Cz) is equal to Xi(z) minus X.-;(z). 
Therefore, without a delay in the feedback, the 
computer would be faced with the problem of 
calculating a number which depends on itself. 
This problem is avoided by inserting a delay 
of one sampling period in the feedback loop. 
With the delay added, past values of Xr,(z) are 
used to calculate the present value of Xr,(z). 

The transfer function of Figure 6 is 

X.(z) G(z) 

X,(z) ~ l+G(z)H(z) ~ 

( 1 „ 

z'+(05K— 1.904837)2+.904837 ' ' 

The coefficient for z in the denominator poly- 
nomial of equation (17) contains the unknown 
gain constant K. This constant originated from 

5 

equation (15) where the z transform of 

was calculated. The reason given for keeping 
the numerator of equation (15) a constant K 
was that this numerator coefficient would under- 
go an adjustment when the roots were cor- 
rected. The denominator roots of equation (8) 
are the correct roots for the system of Figure 
4. Since the roots determine the transient re- 
sponse of a system, the denominator of equa- 
tion (17) must match the denominator of 
equation (8) if the system of Figure 6 is to per- 
form as the original system of Figure 4. The 


denominator of equation (17) is matched to the 
denominatoi* of equation (8) by solving for the 
gain constant K. 


.05K 

— 1.904837 ^ —1.89295 

(18) 


.05K = .011887 

(19) 


K = .23774 

(20) 

X,(z) 

.011887z2 

(21) 

Xi(z) 

“ z2—1.89295z+. 904837 


Equation (21) is the final transfer function. 
It has the same denominator as equation (8) 
and the numerator coefficient is equal to the 
sum of the numerator coefficients of equation 
(8). Since the sum of the numerator coeffi- 
cients of equation (8) equals the numerator 



Figure 7A. Linear Feedback Control System. 

coefficient of equation (21), the steady-state 
gain of (21) is equal to (8) as desired. The 
numerator is seen to be of an order higher 
than equation (8). This introduces a lead in 
the system output. 

Figure 7 shows a comparison of the output 
response of the system of Figure 4 to a pulse 


Xl(t) 



Figure 7B. Input Pulse. 
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Figure 7C. Output Pulse. 

input for the two methods of solution. Part A of 
the diagram shows the system; Part B repre- 
sents the pulse input; and Part C gives the out- 
put responses. The results are for the nonlim- 
iting mode. The solution which uses standard 
z transform techniques agrees exactly with the 
continuous solution given by equation (22). 
Equation (22) was found from the inverse 
transformation of equation (2). The input no- 
tation Xi(t) has been changed to unit step no- 
tation U(t). 

X.,(t) = 2U(t) — 2e-»cos (2t)— e-' sin (2t) 

— U(t— 1/2) 

[2— 2e-*+v2 cos (2t— D— e-'+''2 sin (2t— 1)] 

( 22 ) 

The solution as calculated from the difference 
equations derived from Figure 6 leads the exact 
solution, as expected, since a pulse input was 
used and no delay in step or pulse inputs is in- 
troduced by the sampling process. This lead 
may be programmed out, or may be desirable 
in cases where instruments are driven in a 
simulation. For an arbitrary input, the stair- 
step approximation introduces a delay in the 
input which cancels all or much of the lead due 
to the solution technique. 

Figure 8A is Figure 4 with the limiter added 
to the forward loop. The limiter limits the peak 
magnitude of the variable X4(S) to unity. Fig- 
ure 8B is the z transform of Figure 8A. It is 
the same as Figure 6 except for the addition 
of the limiter and the value for K. Curve B 
of Figure 9 is the difference equation solution 
of the system of Figure 8B for the pulse input 



Figure 8A. Laplace Feedback Control System 
with Limiter. 


shown in Figure 7B. Curve A is the solution 
of Figure 8A for the same input pulse. This 
solution was achieved by Runge-Kutta integra- 
tion at a solution rate of 2,000 solutions per 
second. The difference equation solution leads 



Figure 8B. Z Transform Feedback Control System 
with Limiter. 


the Runge-Kutta solution but has the same am- 
plitude and frequency. The difference equa- 
tions were written from the linear portions of 
Figure 6 such that the computer would calcu- 



Figure 9. Output Response for a Pulse Input for 
Circuit Shown in Figure 8. 


late around the loop a step at a time. This 
step-wise solution was necessary to check for 
the limit. The equations used were derived 
from Figure 8B. 
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X 2 (nT) = Xi(nT)— X 5 (nT— T) (23) 

X 3 (nT) = .23774X2(nT) + .904837X3 (nT—T) 

(24) 

X4(nT) = LIMIT 111 X3(nT) (25) 

Xr,(nT) = .05X4(nT)+X5(nT— T) (26) 

Figure 10 shows, in chart form, the effect of 
limiting on the system roots. A comparison is 
made between the root loci of the continuous 
system and the z transform system for several 
values of limiter gain. There are two z trans- 
form root loci ; one is for a sampling period of 
T = 0.05, and the other is for a much slower 
rate of T = 0.2. The two z transform root loci 
correspond very closely to the Laplace for values 
of limiting gain near unity. This is true be- 
cause the z transform equations were matched 
for the nonlimiting case K = 1. For smaller 
values of limiter gain, the roots of the z trans- 
form deviate slightly from the Laplace roots. 
This produces a small shift in the transient 
response frequencies. 


Limiting Gain 

Root: S = R 

e 

± jl 

m 


Laplace and 
z Transform, T = .05 

z Transform 

T= .2 

1. 000 

-1. 0 ± 3 2. 00 

-1.0 ±j 2,000 

.922 

-1,0 ±i 1.90 

-1.0 ±J 1.899 

.848 

-1.0 ± j 1.80 

-1.0 ±j 1.797 

.778 

-1.0 ± j 1.70 

-1,0 ±j 1.696 

.712 

-1.0 ± j 1.60 

-1.0 ± j 1.595 

.650 

-1.0 ± j 1.50 

-1. 0 ± j 1. 493 

.592 

-1.0 ±j 1.40 

-1.0 ±j 1.393 

.538 

-1.0 ±i 1.30 

-1.0dk j 1.292 

.488 

-1.0 ± j 1,20 

-1.0 ± j 1.191 

.442 

-i.a± j 1. 10 

-1.0 ± 3 1. 091 


Fi^re 10. Root Locus Comparison. 

ROLL CHANNEL EXAMPLE 

This example will explain the important 
features of the z transformation of the roll 
channel of Figure 1. Figure 1 is expressed in 
Laplace notation, except for the roll accelera- 
tion (p) equation. Before the z transform is 
computed, this equation is placed in Laplace 
notation. The equation for p contains time- 
varying coefficients and crosscoupling terms 
from the yaw and pitch channels. These cross- 
coupling terms have been expressed as a func- 
tion of time only; however, they are mutually 


dependent functions. Thus, rolling of an air- 
craft will affect its yaw and pitch rates. Be- 
cause of this interdependence of the cross- 
coupling terms, the equations for roll accelera- 
tion (p), yaw acceleration (r), and pitch ac- 
celeration (q) form a system of nonlinear dif- 
ferential equations. For analysis purposes, 
each of these equations may be analyzed sep- 
arately for a predetermined region of flight 
by simply considering the crosscoupling terms 
as additional inputs to the transfer function. 

p(t) = Ku(t)f(t)q(t) [r(t) -I- p(t)q(t)] 

”1“ I^i 3 (t)q [^1 Cos o! — C„ ijSin a] 

(27) 

Since the transfer function for each equation 
is matched to give correct transient response 
and steady-state gain, the total system of equa- 
tions will behave properly because the cross- 
coupling terms are treated as additional inputs 
to each channel. Equation (28) is the result 
of simplifying equation (27) by choosing an 
optimum region of flight. 

p — 65.20f -|- Ki^rq -|- ICi2r -j- ISd, — 12.2p 

(28) 

The flight region was chosen near the high 
performance end. This choice was made so 
that the coefficients in equation (28) would be 
near these maximum values. This enables the 
analysis of the root locus for the overall feed- 
back system to be made at maximum loop gain. 
For a lesser performance region, the coefficients 
would be reduced. This would cause a reduction 
in loop gain and a shift of the operating point 
of the root locus. As previously shown, a good 
match between the actual system root loci and 



Figure 11, Transfer Function Diagram for Aircraft 
Roll Rate. 
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Figure 12. Roll Channel Z Transform Transfer Function (Roots and Steady-State Gain Unadjusted). 


the z transform root loci is possible for lower 
loop gains. 

Figure 11 is the transfer function diagram 
of equation (28). The diagram shows the cross- 
coupling terms as inputs. 

The transfer function of Figure 11 is used 
in place of the portion from to p in Figure 
1. Once this substitution has been made, the 
roll channel transfer function is transformed 
into z notation. This transformation of the 
forward loop proceeds one inner feedback loop 
transfer function at a time. The procedure for 
transforming each inner loop is the same as 
discussed previously and will not be repeated. 
A solution rate of 20 solutions per second was 
chosen. This rate gives satisfactory results and 
permits a real-time solution. Figure 12 is the 
resulting z transformation. 

After performing the z transformation of 
the individual loops of the system, there still 
remains one final loop gain adjustment. This 
adjustment is necessary to match the roots and 
gain of the z transform system to the Laplace 
system. The adjustment in loop gain is deter- 
mined by comparing the root locus of the z 
transform with the root locus of the Laplace 
transfer function. A root locus program II was 

II The Control System Analysis Program computes 
root loci for linear, continuous or sampled data systems 
in open-loop form. The input data may be in either 
S or z transform notation, or any combination of these, 
and may contain transportation lags. Options include 
z transform computation pole-zero loci, and gain loci for 
a specified gain. 

The method of computing the root locus is described 
in the paper “Numerical Methods for the Synthesis of 
Linear Control Systems” by Maurice E. Fowler. The 
paper was published in the 1963 Volume 1 of Auto- 
matica, pp. 207-225. 


used which computes points on the root locus 
for various values of gain (K) which the pro- 
gram inserts in the loop. The gain adjustment 
for the z transform system is determined by 
finding the roots of the Laplace system for 
K = 1, and then comparing the gain of the 
z system at the same point in the S plane. 

Figure 13 is a plot of the two root loci in the 
S plane. 



Figure 13. S-Flane Root Locus Plot with Gain Factor 
(K) as a Parameter. 

Since the computer did not calculate roots for 
K = 1, a linear interpolation was used to deter- 
mine the K — 1 point. The resulting root is 
S = — 19.917 -f j 16.210. The adjacent z root 
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Figrure 14. Roll Channel Z Transform Transfer Function (Roots and Steady-State Gain Adjusted). 


locus does not have the same real part due to 
the characteristic lead introduced by the 
method; however, a close match may be ob- 
tained by using the imaginary part of the root. 
For ImS = 16.210 on the z locus, the gain is 
determined by linear interpolation to be K = 
0.905. Since this gain should have been unity, 
the z root locus was matched to the Laplace by 
reducing the z loop gain by a factor of .905. 

Figure 14 is the final z transform of the roll 
channel. The loop gain adjustment was made 
in the forward and feedback loops. This was 
done to adjust the steady-state gain of to p, 
and also to obtain the correct system roots. 

Figure 15 shows in chart form, the Laplace 
and adjusted z transform root loci. Values of 
gain less than unity may be thought of as a 
reduction in loop gain due to limiting in the for- 
ward loop. The values of gain shown in the 
chart are not the actual loop gains, but are 
multiplicative gain factors inserted in the loop. 


Laplace 


z Transform 

T = .05 

Root 

S=R ±jl 
e m 

Root 

S = R ±3l 
e ra 

R 

I 

Limiting Gain 

R 

I 

e 

m 

K 

e 

m 

-19.917 

16.50 

1.029 

-20.323 

16.52 

-19.917 

16.00 

.979 

-20.308 

15.98 

-19.933 

15.50 

.931 

-20.308 

15.46 

-19.948 

15.00 

.884 

-20.292 

14.91 

-19.948 

14.50 

.839 

-20. 292 

14. 38 

-19. 964 

14.00 

.795 

-20. 277 

13.84 

-19.964 

13.50 

.753 

-20. 277 

13.32 

-19.980 

13.00 

.712 

-20.261 

12.79 

-19.980 

12.50 

.673 

-20.261 1 

12.27 

-19.995 

12.00 

.635 

-20.245 

11.76 


Figure 15. Root Locus Comparison. 


If, for example, the actual loop gain was 10, 
then for K = .90, the system would be behav- 
ing as one with a loop gain of 9. 

One last point worth mentioning is that the 
root loci of Figure 13 are only a portion of the 
loci of the Laplace and Z transform systems. 
Actually, the z transform roots repeat them- 
selves up and down the imaginary axis at multi- 
ples of 2jr/T. It is only necessary to match the 
primary roots of the z transform. The other 
multiple roots will automatically be matched 
when the primary roots are matched to the 
Laplace roots. 

CONCLUSION 

The difference equation technique described 
in this paper has been successfully applied to 
aircraft and spacecraft simulations. In these 
simulations, the difference equations were used 
primarily to describe the mathematical be- 
havior of both linear and nonlinear control sys- 
tems. A reduction in the computing time for 
simulating these control systems was achieved. 
This reduction ranged from a factor of 15 to 20. 

The results of this technique may be made 
to agree almost exactly with those obtained 
from standard Runge-Kutta integration tech- 
niques merely by increasing the solution rate. 
However, increasing the rate will increase the 
computation time which may make the real- 
time criterion difficult to achieve. The air- 
craft simulation was solved at a rate of 20 
solutions per second. The maximum instantane- 
ous deviation from Runge-Kutta solved at 
2,000 solutions per second for the output of the 
roll, yaw, and pitch channels was about 0.5%. 
The steady-state values agree almost exactly. 

The stability of the simulation is known be- 
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fore the simulation is actually performed. This 
preliminary knowledge is gained, of course, 
from the root locus plot. 

The difference equation technique may be 
applied to a wide range of control problems. 
Other possible applications for the technique 

APPENDIX— BASIC z TRANSFORMS 


outside the aerospace field are hydrofoil boats, 
submarines, and industrial control applications. 
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f(t) 

F(S) 

F(z) 
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u(t) 

1 
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S 
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1 
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2 

S* 
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1 
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SINE 
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7 ? — (cos bT)z 

S2 + b* 

2 ? — 2 (cos bT)z 1 

DAMPED SINE 

i-» — at oiTi Vvi" 

b 

e— “‘ (sin bT)z 

c sm Dt 

(S + a)2 + b2 

z8 — 2e“'“'''(cos bT)z -|- 

DAMPED COSINE 


S + a 

z8 — e"'*'(cos bT)z 

e cos ux 

(S + a)2 + b2 
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I. INTRODUCTION 

When controlling the performance of a sys- 
tem, it is often desirable to choose the control 
that will minimize errors in the system and do 
it in the shortest possible time. A practical 
matter that must be considered in the optimi- 
zation in relation to rapid action is the fact that 
control is of a bounded nature. In a great many 
important cases, the constraint on the magni- 
tude of the control effort precludes the use of 
classical variational techniques to design the 
controller. 

In 1956 Pontryagin hypothesized his ‘‘maxi- 
mum principle’’ which has since been proven a 
necessary condition for the optimization of 
linear systems in relation to rapid action.^ 
In solving the minimum time problem for linear 
systems with bounded control, the principle 
leads to a “bang-bang” form of control law. 
This implies that the control effort is always 
being applied at its maximum value. There re- 
mains, however, the task of finding the opti- 
mum time to switch the control. Pontryagin’s 
method leads to a rule for switching the con- 
troller which is a function of the initial condi- 

* Supported in part by the Office of Naval Research. 


tions in the system adjoint to the one being 
controlled. Generally these initial conditions 
are difficult to find. 

It is usually helpful to consider the control 
problem using state space techniques. The co- 
ordinates of the space for an n^*“ order system 
here are a displacement error and its n — 1 time 
derivatives. The space may be divided into two 
regions each of which is characterized by the 
control optimal for the trajectories in that re- 
gion. Optimum switching between the two con- 
ditions of the bang-bang control occur on the 
hypersurface dividing the space. The switching 
criteria can then be stated as a function of the 
state space variables. 

Of considerable value in finding the switching 
surface is the system adjoint to the system. The 
adjoint can be thought of as the system running 
in reverse time. By plotting trajectories from 
the origin of the error state space “backwards” 
in time, with the control satisfying the respec- 
tive adjoint variables, a surface is generated 
which may be related to the optimal switching 
surface in the system state space. 

A problem of interest occurs when the sys- 
tem is of such a nature that when control is ap- 
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plied, a discontinuity appears in one or more 
of the system states. This may happen when 
the control is of a bang-bang form and the for- 
ward transmission path of the system contains 
zeros. It could also show up if the control is of 
such a form that it approximates an impulse 
to the system. When there are discontinuities 
in the state space due to switching it is gener- 
ally no longer possible to write the switching 
criteria as a function of the state space vari- 
ables. 

One alternative might be to switch the con- 
trol as a function of time. This may be done 
effectively when the number of switchings to 
reach the origin of the error state space is no 
more than n — 1 in an n*’‘ order system. Such a 
restriction limits one mainly to considering 
only those systems with real, distinct eigen- 
values. Large disturbances in lightly damped 
(oscillatory) systems may require more than 
n — 1 switchings to zero the error states. The 
most important consideration when controlling 
as a function of time is the means of implement- 
ing the switching logic. To accomplish time 
dependent control, it is virtually mandatory 
that a digital computer be inserted in the con- 
trol loop. 

Another approach to the problem is to find a 
system that reacts identically to the system 
with zeros except at the points of discontinuity. 
Control of this parallel system can be stated in 
terms of the state space variables. This logic 
can then be used to switch the original plant. 

This paper will be an investigation into the 
latter method. The problem is as follows : 

Given a second order oscillatory system 
with one zero, find the optimum control for 
zeroing the errors in the system in minimum 
time and for zeroing the errors with mini- 
mum fuel. 

The method of Pontryagin is used to solve the 
problem. The brief description of the method 
presented here is based on the work of Rozo- 
noer.^ 

II. PONTRYAGIN’S MAXIMUM PRINCI- 
PLE 

Given the system state variables described 
by n first order differential equations 

Xi zzr fi {x,uX) i= 1, ....... n (1) 

where x is sl column vector in phase space and 


w is a column control vector consisting of r 
control elements. 


The control u(t) must belong to a closed sub- 
set U of admissible controls and must be piece- 
wise continuous. The trajectory x(t) in the 
phase space is uniquely determined by (1) when 
control u(i) and the initial conditions 


f x<^ 1 
1 


x(0) =z x^ = 


( 2 ) 


I"! 

are given. 


The control u(t) of a system may be consid- 
ered optimum under a variety of criteria. A 
large class of optimization problems may be 
solved by presenting the criteria in such a way 
that the solution is attained by minimizing a 
linear function of the final value of the state 
space variables. A control must be selected 
from U that will transfer the system (1) from 
x^ to some fixed closed set G of the phase space 
such that 

n+I 

Sr= ^ CiX,(T) (3) 

1 

is a minimum. The constants Ci and the x„+i 
coordinate are chosen such that minimizing 
(3) optimizes the system. 

In a great many cases optimization of only 
one of the coordinates of the system is desired. 
For example, in order to optimize the magni- 
tude of 

T 

j F{x(t),u{t))dt (4) 

0 


for T and a;(T) either fixed or free in a system 
(1) for u(t)«U, a new variable is introduced: 

jpix it), u{t))dt (5) 

0 

X®„+1 = 0 

and another differential equation 
x.,+1 = F(a:(t),M(t)) 

is added to (1). The problem of optimizing the 
integral leads to optimizing Xn+i(T) at t=: T. 
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Minimizing x„ + i(T) in the system (1) with 
x„+i(t) adjoined is accomplished by putting the 
problem in functional form (3) and applying 
the maximum principle to gain the solution. 
That is 

n+1 

S= ^ c,x,(T) =x„+,(T) (6) 

1 

is the functional to be minimized. Here 
it may be seen that Ci = C 2 = . . . , Cn = 0 and 

Cn + 1 1 , 


A new dependent variable p(t) is now formed 
such that 

n+1 

Pi(t) — — ^ p„ 

1 

The function 

n+1 

H= ^ Psfs(a;,M,t) (8) 

1 

is introduced from which equations (1) and 
(7) may now be written 


dXi 

i = I n+1 (7) 


Xi = 


an 

api 


Pi = 


an . 


axi 


i = 1, 


,n+l (9) 


The control u*{t) is said to satisfy the maxi- 
mum condition if H(x*(t),p*(t),M*(t)) reaches 
an absolute maximum at each time t 
(0^ t ^ T) where x*(t) and p*(t) are the 
values of the variables at time t with u* (t) e U 
controlling. For linear systems of the type dis- 
cussed in this paper, the necessary and sufficient 
condition for minimizing 
n+1 

S = ^ CiX*(T) 

1 


optimally with admissible control is that the 
control satisfy the maximum condition. 


To use the maximum principle, H is formed 
and maximized with respect to tt(t). This pro- 
duces a 

u*it) = 4 ,(x,p) (10) 


which may be used with Equations (9) and the 
boundary conditi ons to find u*(x). If the end 


point of x(t) is not fixed, it becomes necessary 
to obtain boundary conditions on p(t) in order 
to arrive at a solution. The conditions p(T) 
may be found using a function F(x) =0 which 
describes G and x'(T) € G, the end point of an 
optimum trajectory. The form of p(T) will be 
stated without detailed explanation; however, 
it may be noticed that at time t = T, p(T) is 
orthogonal to a hyperplane 
n+1 

^ a,(Xi — Xi) = 0 

1 

through the endpoint of the trajectory and 
directed toward that portion of G where 

n+1 n+1 

C=Xi^ ^ C;X, (T). 

1 1 

The coefficients at may be expressed as a linear 
combination of the C| and bi(x'(T)), the latter 
being coefficients of a hyperplane through 
x*(T) bracketting G. Thus 

Pi(T) = — ACi — ;itbi(x'(T)) (11) 

where A and p. are non-negative numbers one of 
which may be set equal to unity as it is only 
the ratio that is important. 



Generally, three situations arise as to final 
boundary conditions. 

(i) If Xi(T) are specified for i = 1,2 

m then these become the boundary con- 
ditions for (9). 

(ii) If Xi*(T) are internal points of G for 
i(l ^ i ^ n+ 1) then bi(x' (T)) = 0 
and Pi(T) = — Cj. 

(iii) If Xi)(T) are boundary points of G for 
some i(l ^ i ^ n+1) then F(x(T)) = 0 
andthept(T) are as in (11). 


When F is differentiable, the bracketting 
hyperplane through x‘ has coefficients 


b;Xl(T)) 


aF 


axi 


Xi 


Xi(T) 


( 12 ) 


If finding the optimum control for minimum 
transit time another condition must be fulfilled 
since T is not fixed beforehand. This condition 
isthatH(T) = 0. 
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III. DEVELOPMENT OF SYSTEM EQUA- 
TIONS 

The equation of a second order system with 
zeros may be written 

‘c + 2Cojc + a>2c — aiU + a.u (13) 
where c is the output variable of the system 
and u is the output of a controller. 

This paper is concerned with control of simi- 
lar systems that are purely oscillatory in nature, 
i.e., E^ =; 0. To facilitate ease of computation in 
the analysis, Equation (13) is scaled to 

c + c = aiU -f- u (14) 

which when written in terms of the Laplace 
transform of the output variable becomes 

c(s) (15) 

s 1 

This system is represented in block diagram 
form in Fig. 1. 



fnuTttn 

u 

ais+J_ 



s*+l 


Figure 1. Block Diagram of Control System. 

The response of the system to a step input is 
investigated more readily by means of the error 
variable 

e=:r — c (16) 

If the input r is fed forward as in Fig. 2, the 
Laplace transform of the error, given 



1 


Iqy- 

CONTROL 

-ua\— 

a i-f 1 


s*+ 1 


IT 


Figrure 2. Controlled System with Input Fed Forward. 

C(0) = C® 

C(0) = C® 

R(S) = r„/s (17) 

becomes 
E(s) = 

(rii — c®)s — (c® -|- airo) — (ais-|- l)U(s) 

sM^^ 


Now the problem of zeroing the error states 
reduces to that of zeroing the error initial con- 
ditions in the system. 

Finally with the introduction of state space 
variables 

e^ = e, (19) 

the system equations can be written in vector 
matrix notation 


IV. THE MINIMUM TIME PROBLEM 

The problem is stated as follows : 

Given the system (20) and a control force of 
bounded magnitude |u| ^ 1, find the optimum 
control «*(t) to transfer the state variables 
from some initial point in the phase space to 
the origin of the phase space in minimum time 
T. 


That is, given 

e(0) = e" 

e(T) = 0 (21) 

|nl ^ 1 

and the system (20), find M*(t) such that 
T 

S = f adt (22) 


is a minimum where a is a positive constant. 


Introduce 


T 

e„+i = = S = j a dt 

0 

The system equations then become 



' 0 1 01 


0 

e = 

—10 0 

e + 

Sli U — j- U 


- 0 0 0, 


la 


Because of (21), the functional 


3 

S= Y 
1 


(23) 


(24) 


(25) 


( 18 ) 
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and since we wish to minimize this, — 1 is 
chosen. ea(T) is not limited, hence the boundary 
condition becomes 

P,(T) :=_Ch=..— 1 (26) 

By ( 8 ), the hamiltonian becomes 

H = Pie. — p.ei +P:> (aiu + u) + p^a (27) 
Since 


it is evident that pa is a constant and therefore 
P;^ = p:4T) = 1 and now H is 

H = Pi 6 . — p^ei + P 2 (aiU + u) —a (28) 
which is maximized in u if 

aiU + u = N[sgn P 2 ] (29) 

where N — max |aiU + u| for each fixed 
t(0 ^ t ^ T). The control u*(X) which satis- 
fies these conditions is a “bang-bang’' type con- 
trol where u — 1 at all times and u at the mo- 
ment of switching is unbounded. 

Since has served its purpose in the optimi- 
zation process, we may now return to the sec- 
ond order system and solve for the “impulse” 
variables. By (9), 

<30) 

and the solution for u* (t) becomes 

u*(t) =i 1 • sgn[cos(t + ©)] (31) 

where © is a phase angle dependent on 

Several properties of the optimum controller 
are now known. First, the control is a bang- 
bang type which applies maximum effort at all 
times in one of the two “directions.” It is 
switched periodically from one state to the 
other every half cycle until the origin is 
reached. Notice that each time the control is 
switched, a discontinuity appears in the 62 
variable. This occurs because u contains an 
impulse. 

ts + 

Ae. — J ( — ei -f- aiU -{- u) dt ^ 
t- 

ts 

j a, udt = ai[u(t, ) — u (t, )] 
t»- (32) 


where Ae. is the discontinuity in e. at the time 
of switching ts. 

One would now like to find a switching curve 
L(e) which divides the phase plane ei vs. e. in 
such a manner that control u* — +1 is optimum 
in the space to one side of the curve and u* 

— 1 elsewhere. Control would be switched when 
the trajectory c*(t) crosses the curve. The dis- 
continuity Ae. precludes this possibility. For 
example, examine the trajectory e*(t) for some 
initial conditions that dictate u* = — 1 for op- 
timum control. At the point where this trajec- 
tory crosses L(e) the optimum becomes u* — 
+1. The control switches and Ae. = +2ai 
occurs which places the states back in the space 
where u* = — 1 was optimum. Here the con- 
trol switches again, Ae., — — 2a, occurs and 
chatter motion begins. The fact that e. is mul- 
tiple valued at the instant of switching makes 
a simple realization of h{e) impossible. 

For periods between switchings where u = 0, 
the system is well behaved with the solution 
for the k‘^' interval 

(f ) — — K^cos(t (^k) — 3 

ei(t) = K<pcos(t -(- <pk “h Tr/2) (33) 

w^here 8 = 1 - sgn p. and K, <^k depend on con- 
ditions of states at the start of the k^*' interval. 


4.1 The transformed variable 

The search for a variable of the system on 
which to control leads to the possibility of “sub- 
tracting out” the discontinuity present in e 2 at 
times of switching. 


The Laplace transforms of the system vari- 
ables are 

eiS -|- e 2 (ajS — l)U(s) 


Ei(s) = 


s^ + 1 


(34) 


E 2 (s) 

where 

U(s) = 8 ^ 


©2S ““ ©] “I” s (^iS — |- 1 ) U ( s) 
s 2 -f 1 


1 

s 


s s 


) 


(35) 

which for any instant of time t( 0 ^ t < ti) 
U(s) =1 (36) 

o 
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Equations (34) then become 


E,(s) 


e?s^ + (eg + aia)s + S 
s(s2 -|- 1) 


E2(s) 


(©2 “1“ aj8)s -|- (— e? -f- 8) 

s^ + 1) 


(37) 


By means of the initial value theorem, it is 
seen that 

lime](t) = limsE,(s)=ei 
t->0 s->«> 

(38) 

limei.(t) = lim sE 2 (s) = e^. + ai8 
t-^0 s^“ 


At time t = 0, e^ jumps to Ci. 4- ai-S To remove 
this discontinuity consider the transformed 
variables 

Y,(S) E,(S) 

(39) 

Y.(s) = E,(s) - ^ 

o 

By virtue of (39) and (20) 
sY,(s) =sE,(s) =E,(s) .= Y,(s)+ 

(40) 

sY,(s) =sE.(s) — a, 8 = — Y,(s) + - 

s 


4.2 Boundary conditions and final control 

From (38) and (39) it is clear that the initial 
conditions on the e and y variables are identical. 
From (39) it is also seen that 

y,(T) = e,(T) 

ya(T) =e,(T) — ai8(T) (44) 

At this point in the pursuit of the optimum con- 
trol, it becomes necessary to investigate the 
system action possible at time t = T under ad- 
missile control. ACi. of (32) provides a means 
of changing the value of 62 instantaneously by 
an amount dictated by the constraints on u(t). 
With this in mind, it is noted that appropriate 
use of Ae 2 within the bounds of allowable con- 
trol may zero the e 2 variable in zero time given 
that e 2 (T) is within range. The conditions (21) 
and (44) with (32) indicate that for 

iy2(T)|^a, (45) 

the system (20) may be zeroed instantly.* The 
boundary conditions on (41) then become 
yi(0) = y, = e, i = 1,2 

y,(T) = 0 (46) 

iy2(T)| ^a, 

The final controller U 2 (T) that must zero the 
errors for t > T has two conditions imposed 
upon it, i.e.. 


or 

where 8 is a unit step function with sign to be 
determined. The system (41) is identical to 
that of (20) except for the action at time of 
switching. It should be noted, however, that 
care must be taken in assigning final values to 
the system described by (41) if the two plants 
are to be controlled in parallel. The final value 
theorem and (39) gives 

lim Y 2 (t) = lim sY 2 (s) = lim s^E 2 (s) — 
t->oo s->0 s-^0 ^ 

= -a, 8 (42) 

From this it is observed that zeroing the final 
states in (20) is analagous to zeroing y, (T) 
and attaining a final value 

y2(T)=— a,8 (43) 

in the system (41). 


aiU2 U 2 = 0 

U2(T)— 8(T) = 
The solution to (47) is 


(47) 


U2(t) = — ^^exp ^ t — T 


(48) 


It is assumed then that u^Ct) is available at 
time t — T so that the boundary conditions on 
the system are as stated in (46) . 


4.3 Switching functions 

The method of finding a function L(^) with 
which to describe the switching criteria for the 
optimum trajectory proceeds as follows. As in 
the discontinuous case, it is desired that 

T 

S = J adt (49) 

0 

* The conditions are stated in terms of the y variable 
for convenience in order that notational problems aris- 
ing from multiple value of eo(0) be avoided. 
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direction of 
positive time 


Figure 3. vs. y.) Phase Plane with Trajectories 
for 5 = ± 1. 

be minimized, therefore, another variable = 
S = C 3 Ya(T) is adjoined to the system and 
once again Ci = C 2 = 0. The hamiltonian be- 
comes 



Figure 4. Concurrent Action of p(t) and y{t) 
for y.(T) = — a^. 


and 

p,(T) =_ac 2 — Mb.>(y.(T)) = — f.y, (54) 

where with modulus such that F(T) = 0. 
In the phase plane of pi vs. p^ it is sufficient to 
note that for trajectories terminating at yi (T) 
= — ai, P 2 (T) ^ 0 and for trajectories ending 
at yi (T) = ai, P 2 (T) ^ 0. This information 
in addition to the control (51) completely de- 
fine L(^) for trajectories ending on the ex- 
tremes of the line segment y 2 (T) | ^ ai. 


H = piy 2 + Piai6 — P 2 yi + — a (50) 

This is maximized in S when 

8 = 1 • sgn (aiPi + P 2 ) (51) 

With this control, trajectories are circular 
about (8, — ai8) with radius determined by y^. 
(See Fig. 3.) 


Previous agruments have determined that 

i = 1,2,3 

(52) 


the conditions on the system are 

y (0) 

= Yi 

yi(T) 

= 0 

|y.(T)i 

^ ai 

H(T) 

= 0 

P3(T) 

z= 1 


The function 


1 




ai) ^ 0 


may be used to describe G. From this 
8F 


b.(y.(T)) = 


8y2 


= yi- 


(53) 


Fig. 4 depicts representative action for opti- 
mum trajectories terminating at y 2 (T) = — ai, 
yi(T) = 0. Trajectories ending at yoCT) = 
+ai, yi(T) = 0 are mirror images. The opti- 
mum switching curves are generated by picking 
arbitrary values of p(T) from the admissible 
set for the corresponding boundary values of 
?/(t) and working backwards in time plotting 
the switching points determined from p{ — ^t) 
on the y, vs. y^ phase plane. 

For trajectories ending in the interior of the 
line segment where | y^tT) | < ai, batyi (T)) 
= 0 and, therefore, PalT) =0. This completes 
the information necessary to describe L(i/). 
Fig. 5 shows a representative trajectory arrived 
at by translating switching criteria from the 
p plane to the y plane. Fig. 6 portrays the curve 
with all dimensions. 

V. THE MINIMUM FUEL PROBLEM 

The minimum fuel problem is solved by mini- 
mizing the integral 
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switching curve 

Figure 5. p, vs. p^, and yj vs. Phase Planes with 
Complete Switching Curves. 


T 

J = |(|ul) + |a,u|)dt (55) 

0 

in the system (20) where T is not specified. It 
appears simpler, however, to once again make 
use of the transformed variable y(t). By mini- 
mizing 

T 

J = f iuldt (56) 

0 

in the transformed system (41), the desired 
result can be obtained provided 

i) the switchings in the time interval 0 ^ 
t ^ T are kept to a minimum. 

ii) adjustment is made at time t = T when 
fuel is consumed zeroing the error states 
e(t) with the exponential control U 2 (T). 

After adjoining (56) to the system (41), the 
hamiltonian becomes : 

H = p,y 2 — p^yi + u(aip, +P 2 ) — |u| (57) 

Since T is not specified H(T) = 0. With u(t) 
constrained as before, the control that maxi- 
mizes H with respect to p(t) is: 

u* = 1 • sgn (aiPi + P 2 ) |aiPi + p-l > 1 

u* = 0 |a,pi + p:,| < 1 

(58) 


5.1 Initial conditions 


Taking the time derivative of H 


dH 

dt 


(aiPi + Pa)-^ — 


d|u 

dt 


( 59 ) 




Figure 7. Optimum Trajectories of c(t) and y{t) 
(minimum time). 
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Figure 8. Optimum Trajectories of e(t) and 3 /(t) 
(minimum time) 


Figure 9. Optimum Trajectories of c(t) and y{t) 
(minimum time). 


A 

V/. J-t LlLiiy 


also be argued that the change in the hamil- 
tonian with time is zero if 


Since u(t) is switching between u = 0 and u — 
±: 1 and vice versa, this means that the 
hamiltonian remains constant if the control is 
switched at aiPi -f p:> = 1 • sgn (au). (See Fig. 
10 .) 


By choosing control u*(t) the hamiltonian 
remains at its maximum value, i.e., identically 
zero from time t = 0+ after initial control has 
been applied until time t = T. This control 
minimizes the integral (56) but does not neces- 
sarily minimize total fuel when fuel consumed 
at switchings is added. In order to minimize 
switchings, it appears necessary to choose the 
degenerate case, i.e., u = 0 until such time as 
aiPi + P 2 = 1 * sgn (a u) where Au is the 
change in u(t) when turning the control on. 
Notice that this choice guarantees that H(t) 
= 0 for all t, 0 ^ t ^ T. With this in mind, the 


problem remains to minimize fuel in the non- 
degenerate case. For this purpose it will be con- 
sidered that time t = 0 is that time when 

aiPi + P 2 = 1 • sgn (Au) (61) 

and initial control is applied. 

At t = 0 it may be verified from (61) and be- 
cause H(0) = 0 that 

Piy^ — PL>yi = 0 (62) 



Figure 10. Switching Criteria in p^ vs. p^ Phase Plane. 
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Figure 11. Admissible Points p(T) for y 2 (T) = — Ej. 

5.2 Final boundary conditions 

In order to investigate final value boundary 
conditions, the optimum trajectories terminat- 
ing such that yi(T — At) > 0 are considered. 
Trajectories in the rest of the space are mirror 
images. As in the minimum time problem, an 
optimum trajectory terminating at y 2 (T) = 
— a,, yi (T) = 0 is investigated first. The deter- 
mination that PalT) ^ 0 as argued in (54) is 
still valid. This condition on P 2 (T) along with 
the fact that H(T) =0 precludes the possibility 
of a trajectory terminating as above with u(T) 
= — 1. The following cases, however, do apply. 
Consider 

H(T) = — a,p.(T) -hu(T) [a,p, (T) 

+P.(T)] — iu(T)! =0 (63) 



Figure 11a. Switching Criteria for y 2 (T) = — a, 
(minimum fuel). 


This condition implies that if u(T) = 0 then 
Pt(T) = 0 and if u(T) = -|- 1 then p(T) ^ 0 
and P 2 (T) = -|-1. Fig. 11 portrays the locus of 
admissible points p(T) and the switching 
curves generated by these criteria in the yi vs. 
y 2 phase plane are as in Fig. 11a. 

Optimum trajectories terminating on the line 
segment yi(T) =0, | y 2 (T) | < ai must be in- 
vestigated in a fashion similar to that used with 
the minimum time problem. Since a final 
boundary point y 2 (T) is not fixed, we may sub- 
stitute a final condition on P 2 (T) to reach a 
solution. At this point it becomes necessary to 
decide on the final value functional to be mini- 
mized. It is first noted that if the final control 
to the line segment is u(T) = 0, then — a, < y 2 
(T) < 0. (It must be remembered that investi- 
gation is of trajectories such that yi(T — At) 
>0). If u(T) = 0 then also y 2 (T) = OilT) and 
in order to minimize the fuel consumed by U 2 
(T) to zero e-^ after time T then | e 2 (T)| = | y 2 
(T) I must be minimized. 

If the final control is u(T) = — 1 (u(T) = 
+1 is not possible for trajectories terminating 
on this side of the line segment) then e 2 (T) 
y 2 (T) — a, and, therefore, |y 2 (T) — a,| must be 
minimized. In both of the above cases, it may 
be seen that y 2 (T) must be maximized on the 
line segment in order that fuel consumed by U 2 
(T) to zero the error states be minimized. 
Therefore, the functional to be minimized is 

3 

S= ^ c,y,(T) = — y 2 (T) -Ky 2 (T) (64) 


y:i(t) = J |u|dt (65) 

0 

By prior arguments P 2 (T) = — C 2 = -|-1 and 
P;,(T) = p;,(t) — — C:, = —1. 

5.3 Generating the switching curve segments 
It is now helpful to look at the hamiltonian 
under each of the above conditions, i.e., u(T) 
= 0 and u (T) = — 1. In the first case 
u(T) - 0 
H(T) = H(t) = 0 
y.(T) = 0 (66) 

P2(T) = + 1 
—a, < y 2 (T) < 0 
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and 

H(T) p,(T)y,(T) =0 

which implies that Pi{T) = 0. Fig, 12 shows 
the switching generated by this condition. 




Figure 13a. Switching Criteria where u(T) = — 1. 


Figure 12. Switching Criteria for u(T) =0 
(minimum fuel). 

Next is considered the case where 
u(T) = —1 

H(T) = H(t) = 0 (67) 

y,(T) = 0 
P.(T) == +1 
and 

H (T) = p, (T) y, (T) - 1 [a,p, (T) + 

1] —I = 0 

from which 

Since u(T) = — 1 and p»(T) = +1, conditions 
(58) are met only when 

p,(T) 

which implies y 2 (T) > 0. In Fig. 13 these tra- 
jectories and switching curves are plotted. 


5.4 The complete stvitching curve 

Because T was never .specified and because 
the fuel consumed at switching was handled as 
a side condition, a composite of all the cal- 
culated switching curves iindicates areas in the 
phase plane where (ffiteiiiatfor optimum control 
appear contradidtca^r.. In tfihese areas, analysis 
by graphical means or actual computation will 
clear up the situation. Fig. 14 depicts the com- 
po^te (af file first two criteria analyzed. 



Figure 14. Region of Conflicting Optimum Criteria. 



In Fig. 14, region A is an area where there 
is a question concerning whether it is optimum 
to switch for j y^IT)] = ai or [y-lT) | < a,. B> 
graphical analysis, it may be seen that it is 
optimum to switch so that | y^lT): = a,. 

A similar contradiction between trajectories 
switching for 0 < y^IT) < a, and — a, < y^IT) 
< 0 may also be resolved graphically.* The final 

* Appendix I presents computational analysis of the 
resolving process. 



192 DISCONTINUOUS SYSTEM VARIABLES IN THE OPTIMUM CONTROL OF SECOND ORDER 


result consisting of switching criteria to zero 
the errors in the system (20) with minimum 
fuel is given "by Fig. 15. 



Figure 15, Switching Criteria for Minimum Fuel, 
a, = 1.0. 

VI CONCLUSIONS 

The methods used in this paper to arrive at 
a solution may be used to good advantage in 
the investigation of any n^*' order system with 
no more than n — 1 zeros. The maximum 
principle provides a powerful tool in optimiza- 
tion, particularly for linear systems. Often the 
method of Pontryagin will indicate areas of 
interest to investigate when searching for an 
optimum control even if the unique solution is 
not readily forthcoming. 



Figure 16. Block Diagram of ths Controlled Plant. 


The problem of controlling a plant with zeros 
is analagous to controlling a plant without zeros 
using an impulse-step type controller. Results 
obtained in this paper can be adapted to formu- 
late the logic of this type control. 

The realization of the true optimum switch- 
ing logic in a practical system may in many 
cases not be worth the effort. Quasi-optimum 
control using simple switching functions that 
are for the most part linear is a subject for 
further investigation. Setting time for the sys- 
tem is relatively insensitive to limited varia- 
tions from the optimum when trajectories are 
out beyond the first cusp of the switching curve. 

APPENDIX I 

In the past, the electrical engineer, has usual- 
ly turned to the analog computer for problem 
solving and his reference of familiarity is 
strongest there. To test the drift of the ampli- 



Figure A-1. Trajectories showing the effects of differ- 
ent integration step sizes (A) on the Runge-Kutta 
method. Period of unforced system being integrated 
was T = 6.2836. 
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fiers, a common method was to plug in a simple 
oscillatory system 



and to study the decay of the circular trajectory 
in the phase plane (y vs. y). 

This same procedure was used here to study 
the different integration schemes used and to 
determine the proper integration step size. The 
forcing function (u — ±1) was introduced to 
give breaks in slopes and also discontinuities in 
the steady state limit cycle described by 



Figure A-2. Trajectories showing the effects of differ- 
ent integration step sizes (A) on the Runge-Kutta 
method. Period of unforced system being integrated 
was T = 6.2836. 


This graphical means of analyzing gave con- 
siderable insight in evaluating the integration 
schemes. The following figures give a brief re- 
sume of these studies. 

The 4*’’ order Runge-Kutta method was found 
to be the most foolproof when dealing with dis- 
continuities in the state variables. In the con- 
tinuous variables the Adams Bashforth predic- 
tor-corrector^ with a Taylor expansion or 
Runge-Kutta starter were found efficient and 
useful. 

The system was also tested with the plant on 
the analog computer and the control with its 
nonlinear switching function being generated 
on an on-line digital computer. 

BIBLIOGRAPHY 

1. Rozonoer, L. I., “L. S. Pontryagin’s Maxi- 
mum Principle in the Theory of Optimum 
Systems — Part I,” Avtomatika i Telemek- 
hanika, Vol. 20, pp. 1288-1302, October 
1959. 

Rozonoer, L. I., “L. S. Pontryagin’s Maxi- 
mum Principle in Optimal System Theory 
— Part II,” Avtomatika i Telemekhanika, 
Vol. 20, pp. 1405-1421, November 1959. 

2. Flugge-Lotz, I., and Titus, H. a., “Opti- 
mum and Quasi-optimum control of third 
and fourth-order systems,” Division of 
Engineering Mechanics, Stanford Univer- 
sity Technical Report No. 134, pp. 8-12, 
October 1962. 

3. Flugge-Lotz, I., and Ishikawa, T., “In- 
vestigation of Third Order Contactor Con- 
trol Systems with Zeros in Their Transfer 
Functions,” NASA TN D-719, January 
1961. 

4 . Hamming, R. W., “Numerical Methods for 
Scientists and Engineers,” McGraw-Hill 
Book Company, Inc., New York, 1962. 




TWO NEW DIRECT MINIMUM SEARCH PROCEDURES 
FOR FUNCTIONS OF SEVERAL VARIABLES 

Bruno F, W. Witte Design Specialist 
and 

William R. Holst Senior Research Engineer 
General Dynamics/ Astronautics 
San Diego 12, California 


(1) LINEAR SEARCH METHOD FOR 
n VARIABLES 

(1.1) Summary Of The Method 

The method can somewhat vaguely be classi- 
fied as a modified ‘'steepest descent/’ It is, of 
course, an iterative procedure. Each cycle, to 
be iterated on, consists essentially of two parts : 
in Part I a “best” line is found, in Part II an 
attempt is made to minimize the given function 
along this line. Thus, each cycle resembles the 
corresponding cycle in the method of steepest 
descent. The method of steepest descent differs 
from our method in the manner in which in 
Part I the “best” line is found. Steepest descent, 
in fact, implies that this line (let us call it the 
“baseline” from now on) be defined by the 
starting point for the cycle and the gradient of 
the function at this starting point, where the 
starting point, in turn, is the minimum point 
of the preceding cycle. Well known modifica- 
tions of the steepest descent are concerned with, 
for example, baselines restricted to a subspace 
normal to the preceding baseline, or with differ- 
ent ways of minimizing along a given baseline, 
or perhaps with the question as to how feasible 
it is to seek a minimum at all along a given 
baseline during each cycle before switching to 
the next baseline. 

(1.2) Symbol Explanations 

y — The ordinate of arbitrary points in the 


space i.e. that coordinate which is 
plotted in the same direction as is the 
function value. 

F — The function to be minimized. 

n — Number of independent variables in 
F = F(x, . . .Xn). 

S — Designates the hyper-surface defined 
in 4> by F. 

$ — Designates the space spanned by the n 
independent variables Xj . . . Xn, the 
abscissas, and by y, the ordinate. 

— The space spanned by the n independ- 
ent variables Xi . . . x„. 

X; — The abscissas, i.e. the independent 
variables, i = 1 . . . n. 

X — The point with coordinates Xi, i.e. x — 

(Xi . . .Xn). 

x^ — The starting point in </> for minimizing 
F along b^ in cycle C^. Also the mini- 
mum point found by minimizing F 
along b^— 1 in cycle x^; . . . x^,), 

k = 0, 1, 2, . . . . 

— Designates the k-th cycle of the itera- 
tive procedure leading to the minimum 
of F. k = 0, 1, 2, 3, 

F^ — An abbreviation for F (x^) . 

— The starting point in ^ corresponding 
to x^, i.e. S*" = (x^ F^) , k = 0, 1, 2, 

Tk — n-dimensional hyper-plane tangent 
on S in S*". 
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— The baseline chosen in cycle 
grad — An n-vector in the space (f> pointing in 
the direction of steepest acscent of any 
subspace of ^ or manifold of ^ with 
dimension 1, 2, . . . , or n. The magni- 
tude of this vector is equal to the slope 
of steepest ascent. For linear subspaces 
this vector is independent of x. For the 
subspace S this vector is the familiar 
gradient of F. 

A — An operator designating the intersec- 
tion of the two subspaces between 
which it stands. 

— A subspace of 4> with dimension n — i 
defined by P* = T® A T^ A . . . A T^ 

Q* — A straight line in $ defined by Q' = 
Ti-n+i ^ Ti-n +2 A . . . A Ts (i>n — 1). 

pi — The intersection P* A c/). 

q' — The point Q' A <f>. 

U — The point on Q* with y = 1. 

^ — The projection of L' on 
— The equation of a hyper-plane parallel 
to T**' through the origin of <t>. 

Aj — Lagrangian undetermined multipliers. 

1 The running coordi- 
z=z > nates of an arbi- 

) trary point on T*". 

aV — Defined by a^ = — ^F/5Xi, evaluated at 

s — The distance from x*" to a point on b*". 
s is positive in the downward direc- 
tion. 

G — The function F considered as a func- 
tion of only s, i.e. G (s) = F (xi (s) , . . , 
x„(s)). 

G' — The derivative dG/ds. 

(1.3) The Baseline Choice 
The method used to choose the next baseline 
can best be visualized as follows : The function 
y =; F(xi ... Xji) defines an n-dimensional 
hyper-surface, S, embedded in the (n + 1) — 
space, spanned by the n-spaced, <^, of the 
independent variables Xi . . . Xn and the depend- 
ent variable y. Let x"" = (x? . . . xX) be the start- 
ing point of the first cycle, of our search 
procedure, and let F‘' — F(x‘0 and x'^ define the 
corresponding starting point = (x^ F‘0 on 


the hyper-surface.* Let T” designate the hyper- 
plane tangent in S‘* on S (whenever necessary 
we will assume that such tangent hyperplanes 
exist). The baseline, b", for the first cycle, C'‘, 
is then chosen to be the line through x" and with 
direction defined by the gradient of T'*, grad T'‘. 
Naturally, we have for this cycle grad T‘* =: 
grad F(x'’). We will now assume for the mo- 
ment that we know how to proceed with part II, 
and that at the end of part II we obtain an ap- 
proximation x^ = (xi . . . xi) for the minimum 
of S along b‘*. x^ is also the starting point of 
cycle (In general, x’ designates the starting 
point and s‘+’ the minimum point of cycle C*; 
S‘ = (x\ F*) is the point on the hyper-surface 
corresponding to x^ and T' is the hyperplane 
tangent in S’ on S ; finally b is the baseline for 
cycle CO- The next baseline b^ is then chosen 
to be the line through and with direction de- 
fined by grad (T*' A TO, where T® A T^ is the 
intersection of 'T' and T*, i.e. an (n — 1) -dimen- 
sional linear subspace of <i>. Assuming again 
that we know how to find x-, the minimum point 
along b- is found as the line through x- in a 
direction given by grad (T'' A T^ A T“0 . 

In general, for i<n — 1, b’ is determined so 
that it becomes the baseline through x’ with a 
direction given by grad (T'’ A T^ A ... A TO- 
The argument of the gradient, P’ = T** A T^ A 
T“ A ... A T\ is a linear subspace of and has 
dimension n — i. For i = n — 1, P’ would then be 
a straight line; for i = n, it would be only one 
point, for i>n, P’ is an empty set; for i^n, 
therefore, grad P’ is not defined. For 0^ 
i<n — 1, grad P’ is defined, of course, as the 
direction in <t> along which P’ ascends most 
rapidly. Since all P’ are linear, grad P’ is inde- 
pendent of X. 

For i>n — 1, b’ is determined so that it be- 
comes the baseline through x’ with a direction 
given by grad Q\ where Q’ = T’— aT’“"+- A 
. . . A T\ Thus, all Q’ are straight lines in and 
their gradients are parallel to their normal pro- 
jections on <^. 

attempt is made in the above and subsequent 
notation to designate by lower-case letters all points, 
or point sets, or subspaces, which are completely em- 
bedded in the n-space of the independent variables 
. x,^, and to use capital letters if they are not com- 
pletely contained in this n-space. 
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In general, for all i, the above de- 
fined gradients are determined as follows : 
Let p' and q' be defined by p'=:P'A$ (for 
0^i<n — 1), q' = QLV<#> (for n — l^i). Then 
all q* are single points. Next consider the point 
L’ (i^n — 1) with ordinate y = l on one of the 
lines Qs and its projection on </>. The directed 
line from q' to is parallel to grad Qs and 
' grad Q' | = 1/ i q^ — |. The n coordinates of 
p — (/I . . . /,\) are found by solving simul- 
taneously the n linear equations, E^, corre- 
sponding to the n T*" in the expression for Q' 
and setting y = 1. Setting y ~ 1 is arbitrary, 
and insures only that y ^ 0. The n coordinates 
of q' = (qi . . . q,0 are found by solving the same 
system with y = 0. Next consider some arbi- 
trary point L‘ (0^i<n — 1) with ordinate 
y = 1 on one of the linear subspaces P' of 
dimension n — i, and its projection on <^. Let 
q‘ (for O^i < n — 1) be the point on p' closest to 
The directed line from q' to is then, again, 
parallel to grad and also | grad P' | = 
1/ ' q' — . Since L' can be chosen arbitrarily 

from all the points on P' with y = 1, we set 
(1 = 0 (for k = i + 2, . . . , n), and find the first 
i + I coordinates of = {{ . . . /iVi, /i+o . . . (1) 
by solving simultaneously the i + 1 linear equa- 
tions, E*', corresponding to the i + 1 T*" in the 
expression for F'\ again also setting y = 1. The 
n coordinates of q> = (qi' , . . q,;) are determined 
by requiring that the square of the distance 
from to q’ be a minimum, i.e. d- = ( — q])- 
+ ( /l! — q!:)" + • • • + ( /n — qn)“ = min. 
Furthermore, we note that the qjl must satisfy 
the constraint that q' is to be on p\ which is the 
same as saying that the qk must also satisfy the 
i + 1 linear equations, E^" (k = 0 . . . i), corre- 
sponding to the i 4" 1 T*' in the expression for 
PS but now setting y = 0. The Lagrangian 
method of undetermined multipliers leads us 
then to the unconstrained minimization of the 
function g(qi ... qi, Ao . . . Ai) = d- -f A,,E« + 
. . _|_ XiE\ Equating to zero the partial deriva- 
tives of g with respect to the n qi and the i -f 1 
Aj, we obtain again a system of linear algebraic 
equations, here of order n + i + 1. Note that 
this way of finding q* applies only to the initial 
cycles of the search for which i < n — 1 ; hence 
the largest linear system to be solved here will 
have order n + (n — 2) -f- 1 = 2n — 1. 


The linear equations E^ (O^k) correspond to 
our hyperplanes T^ which are tangent to the 
hyper-surface S at the locations x^. The ana- 
lytic expression for one of the E^ can readily be 
obtained from a Taylor expansion of F about 
x*": F(x) z=F(x^) -f (x — x^) • gradF(x^) + 
higher-order terms. If we replace F(x) by and 
X — (xi . . . Xn) by . II), delete the 

higher-order terms, collect all constant terms, 
and rewrite this expansion in terms of the indi- 
vidual running coordinates on the tangent 
plane, we obtain with grad F (x^) = — (a| . . . 
a^„) : a?; + a?, ^1 + . . . + aS = b*. This 

can be simplified considerably for computa- 
tional purposes by always setting b^ = 0. This 
is permissible because we are only interested in 
the slopes of the T^, or in the slopes of subspaces 
common to several T’^. The E^ are then ^- 
plicitly : E^ = a| + . . . aJ = 0, and 

they describe hyperplanes which are parallel to 
the tangent planes, and contain the origin of 
the space <l>. The a| could be evaluated directly 
at x^ as the partial derivatives of F with respect 
to Xj. Or they may be approximated by finite 
difference approximations to these derivatives. 
The computer program referred to in the ab- 
stract uses first-order difference approxima- 
tions. 

(1.4) Minimizing along the Baseline 
The search for the baselines was explained in 
the preceding section. In this section we de- 
scribe what to do with a baseline once it is se- 
lected, i.e, how to minimize the given function 
F (xi . . . x„) along this line. Let us introduce an 
(s, G) -coordinate system with origin at x^, with 
the positive s-axis in the downward direction of 
the baseline, and with the positive G-axis in the 
positive y-direction. The function G(s) is de- 
fined as G(s) = F(xi(s) . . . x„(s)). We re- 
state now our problem more precisely: given 
So =: 0, Go = G(So), Go = G'(So), and Si (the- 
initial step size), find a ‘'bracketed'' approxima- 
tion, s„i,. to a relative minimum of G(s), and 
verify the existence of two bracketing numbers 
Sa and Sb, such that s., < s„, < s,, and G(Sa) > 
G(s,n) < G(Sb). The procedure is described be- 
low in several steps. 

STEP (1) 

Si is equated to "Si, and G =: G(si) is evalu- 
ated. This defines the point Pi = (si, Gi) for 
i = 1, and with G] = G(Si). 
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Question: Is Gi ^ Go ? 

If yes, a parabola is passed through P„ with 
the given slope Go, and through Pi. This is 
done in the following Step (2) . 

If no, we go directly to Step (3) to check 
whether Pi is above or below the line through 
Po with slope Go. 

STEP (2) 

The minimum location S 2 is found for the 
parabola through Po and Pi with slope Go at 
Po. So will satisfy: |So| < |s 2 | — |si|/2. G 2 = 
G(s 2 ) is evaluated. 

Question: Is G 2 < Go ? 

If yes, we have solved our problem, i.e. we 
have obtained a bracketed approximation to 
the minimum, with Sa = So, Sn, = S 2 , Sb = Si. 
However, our experience showed that the ex- 
penditure of one more function evaluation 
often allowed a considerable improvement of 
the value for Sm. This is particularly desirable 
toward the end of our search for the minimum 
of F(xi . . . x„). The improvement of Sm is 
obtained by cubic interpolation, and is de- 
scribed in Step (4) . Before going there, how- 
ever, the indices 1 and 2 are interchanged on 
the quantities Pi, Gi, Si and P 2 , G 2 , S 2 so as to 
number the points P<„ Pi, P 2 in the order of 
increasing |s| -values. 

If no, we substitute P 2 for Pi, and repeat Step 
(2) until either G 2 < Go, or |S 2 | < |si|/100. 
In the latter case we replace ^1 by — Si, Go by 
the slope of the line (Po, P^), and return to 
Step (1). This starts a search for the mini- 
mum of G in the range of negative s-values 
(assuming that Go was inaccurate). In case 
our search should then again lead us back to 
the vicinity of the same s-value, s — So, we set 
s,o = So with brackets S;, = — |S] |/100 and Sb = 
I'Sil/lOO. In this case our starting point and 
our approximation to the minimum are one 
and the same point. That situation will al- 
ways happen at the final minimum of F (xi . . . 
Xo), but may also happen occasionally sooner. 
The termination procedure, described further 
below, then is triggered. 

STEP (S) 

Question: Is Pi above the line through Po 
with slope Go ? 


If yes, the minimum location So is found for 
the parabola through Po and Pi with slope Go 

at Po. It satisfies |S 2 | > |si|/2. If |s 2 | < !si|, 
the indices 1 and 2 are interchanged. 

If no, the minimum location So is found for the 
cubic with elope G„ at Po and with its point of 

inflection at Pi. It satisfies |s 2 | > 2|si|. 
Question: Is Gi < Go = G(s 2 ) ? 

If yes, we found a bracketed approximation to 
Smin, and proceed with Step (4). 

If no, we re-define Go to be the slope of the 
parabola through Po, Pi, P 2 at point P], then 
subtract 1 from all index values occurring in 
Pi = (Si, Gi) (i = 0, 1, 2), and return to Step 
(3). 

STEP a) 

A cubic is passed through Po with slope Go, 
and through Pi and P 2 . Sm is the location of 
the minimum of this cubic. Note that the 
existence of brackets for s,„ guarantees the 
existence of the minimum. 

STEP (5) 

We test next the three lowest values of G, 
found so far, whether they agree with each 
other within a given tolerance. 

If yes we set Sm equal to that Srvalue for 
which G(Si) is smallest. (s,n usually is, but 
need not be, that Srvalue.) 

If no, a parabola is passed through the lowest 
three points, and the minimum value of this 
parabola is compared with the two lowest 
available values of G. If these three values 
agree with each other within the given toler- 
ance we have found Sm ; if they do not agree, 
we evaluate G for that s-value for which the 
parabola had its minmium, and go to Step 
( 6 ). 

STEP (6) 

We test the two lowest values of G with the 
minimum value of the last parabola. If these 
three values agree with each other within the 
given tolerance we have found Sm ; if they do 
not agree, a parabola is passed through the 
lowest three points of G and we compare the 
minimum values of the last tioo parabolas 
with the lowest value of G. 

If these three values agree with each other 
within the given tolerance, we have found Sm ; 
if they do not agree, we evaluate G for that 
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s-value for which the last parabola had its 
minimum, and repeat Step (6). 

(1.5) Detection of the Minimum 
The iterated execution of the procedures for 
Parts I and II of each cycle may be discontinued 
whenever one of the following conditions is 
detected : 

Condition A: The partial derivatives of F = 
. . . x\) satisfy \dF/dx)\ ^ e for all i = . . . n, 
and for a given small value of e, say € — 10-®. 
As explained at the end of section (1.3, these 
partial derivatives are evaluated to obtain the 
coefficients a^ in the equations E^. 

Condition B: The starting point and the ap- 
proximation to the minimum point on the base- 
line are one and the same point. How this can 
happen was explained in Step (2) of section 
(1.4). 

Condition C: The matrix of the linear equa- 
tions E^, discussed in section (1.3), is very ill- 
conditioned or singular. This may occasionally 
happen before either Condition (A) or Condi- 
tion (B) is satisfied. 

Condition D: The numerical values of the co- 
ordinates of the minimum location and/or the 
minimum of the function have stabilized to a 
given num.ber of significant figures. 

Whenever one, or perhaps two of the above 
conditions are satisfied, there is an excellent 
chance that the minimum has been found. One 
may then do one of two things: (i) terminate 
the execution of the program, or (ii) begin a 
somewhate more extensive testing procedure 
in the course of which the vicinity of the sus- 
pected minimum location is explored, and it is 
thereby made even more likely that the solution 
has been found. This is what was programmed 
at G.D./Astronautics. Preference was given 
to method (ii) because it also provides an an- 
swer to the question how sensitive the function 
is to small deviations from the minimum loca- 
tion. The knowledge of this sensitivity is usu- 
ally desired together with the minimum location 
itself. 

(2) CIRCULAR ARC SEARCH METHOD 
FOR 2 VARIABLES 

(2.1) Summary of the Method 
The iterative circular search is preceded by a 
starting procedure, which finds five points near 


the spine of the valley of the function to be 
minimized. Thereafter, each iteration begins 
with an attempt to approximate by a portion of 
a circle the course of the projection of the valley 
onto the space of the independent variables. 
The first circle is fit to three of the five valley 
points found. All succeeding circles are fit to 
four or more valley points. As the iteration 
progresses toward the minimum these circles 
approach the osculating circle at the minimum 
location. During each iteration the determina- 
tion of a circular arc approximation to the spine 
of the valley is followed by an attempt to pre- 
dict a point on this arc which is closer to the 
desired minimum than any other point obtained 
so far. Again during each iteration, the predic- 
tion of such a point is followed by a steepest 
descent from this point into the valley, which 
amounts to a minor, yet important correction. 
The correction is important since it insures that 
deviations between the courses of the circular 
arcs and the portions of the spine of the valley 
approximated by the arcs will not accumulate 
as the iteration progresses. The new valley 
point then replaces the highest of the four low- 
est valley points found before, and this predic- 
tor-corrector type method is repeated in the 
next iteration. Empirical results show that the 
sequence of valley points obtained converges 
quite rapidly to the desired minimum. In the 
following sections we emphasize again the geo- 
metric aspects of the more important steps 
summarized above, rather than to give the alge- 
braic details. This is done since some of the 
details are still subject to experimentation and 
analysis. 

(2.2) The Starting Procedure 
We assume that a first guess is available as 
a starting location x\ We find its valley point 
v^ A valley point v' will always mean the loca- 
tion of the minimum of the given function F 
along a line through x’ in the direction of 
— grad F(xO. We find the distance d^ = 
|vi — x^l and a unit vector e^ in the direction of 
the projection of — grad F (vU on a line normal 
to — grad F (xD . We then find four more valley 
points vMi == 2 . . . 5) by taking a step d‘ =1 
(3/4)*— 2di in the direction of from v*-^ to 
x\ and then by minimizing from x* to v^ The 
directional minimizations are done by using the 
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same method which was described in section 

(1.4). 

(2.3) The Approximating Circular Arcs 

The circular arcs are obtained as arcs of ‘Test 
fitting’" circles to the locations v‘ of the lowest 
three, four, or five valley points. Three valley 
points are chosen only once, i.e. for the first 
circle ; four are usually chosen thereafter ; if it 
happened, however, that the last valley point 
found had a function value higher than the 
highest of the four lowest valley points previ- 
ously known, then five points were chosen. 

(2.4) The Predictor-Corrector Step 

A “best fitting” two-dimensional plane E;;* is 
found for the valley points V* = (v‘, FO in space 
<I>, where v‘ and i were explained above, and 
F' = F(v0. It is helpful to visualize now a 
cylinder in <l> parallel to the y-axis and intersect- 
ing <!> along the circle of the preceding section. 
This cylinder and the above plane intersect 
in an ellipse. The location u of the point on this 
ellipse with the smallest y-value is our pre- 
dicted point. The distance of the point u from 
the spine of the valley decreases rapidly as the 
iteration progresses, and appears to become 
insignificantly small long before the minimum 
is reached. Nevertheless, it is unknown at the 
present time how damaging the cumulative 
effect woud be. We are employing, therefore, 
during all iterations a straight steepest descent 
correction which leads from u to the next valley 
point. The correction is done in the same way 
as was explained for the five starting points in 
(2.2), i.e. by using the method described in 

(1.4) . The initial step taken in minimizing 
from the low point on the circle to the valley is 
set equal to one-half the length of the gradient 
vector at the low point divided by the coefficient 
of the second-degree term of the parabola ap- 
proximating the shape of the function along 
the previous line of minimization. We expect 
that further studies of this particular correc- 
tion will lead to considerable savings in the 
number of function evaluations. For example, 
it appears that the correction usually need be 
made only in a radial direction rather than the 
negative gradient direction, while the latter is 
needed only occasionally. Moreover, it is doubt- 
ful whether any such correction is needed at all 
in each iteration. 


(2.5) The Switch to Parabolas 

The above predictor-corrector step is taken 
at least four times. If thereafter the function 
value of the lowest valley point is within Y 
of the function value of the second-lowest valley 
point, the method of predicting a new point u on 
the circle is modified somewhat. Instead of find- 
ing the above described plane E^, we find the 
“best-fitting” line L to the same four points v^ 
defining our circular arc and in the same plane 
with this circular arc. The lowest three v' are 
then projected on L to give the three points w’ 
on L. These and the F‘ — values associated 
with the v^ define three points W‘ = (w\ F^ in 
the plane through L and parallel to the y-axis. 
The location w“‘ on L of the minimum of the 
parabola through the three points W‘ is then 
projected back onto our circular arc to give our 
predicted point u. The steepest descent correc- 
tion is gain applied to u, and produces the next 
valley point. 

(2.6) The Tail Correction 

After successive approximations to the mini- 
mum value of the function have stabilized to 
five figures, the search terminates with a simple 
tail correction, which consists in minimizing 
the function along the line through the two low- 
est valley points, and which usually adds sev- 
eral significant figures to the answers. 

It is interesting to observe the behavior of 
the radii of the circular arcs after the iteration 
has begun to converge. As was said before in 
Section (2.1) the circles approach the osculat- 
ing circle of the spine of the valley at the mini- 
mum location. However, once the sequence of 
valley points has converged to the minimum, all 
subsequent points fall very close to each other ; 
they then no longer define the course of the 
valley nor do they define its osculating circle. 
Instead, the radii of the “best-fitting” circles 
suddenly shrink by several orders or magnitude 
to dimensions comparable to the inaccuracies in 
the coordinates of the minimum location. When 
this shrinkage occurs the search should be ter- 
minated. Usually, however, this does not hap- 
pen before the search had already been termi- 
nated with the above described tail correction. 

(2.7) An Extension to n Variables: 

The Spherical Shells Method 

The good results obtained with the two- 
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dimensional Circular Arcs Method suggest its 
extension to n independent variables as follows : 

( — 1) ST ART. A starting point x‘ is given in 
n space as a first approximation. Find its valley 
point vS the distance d^ = |v^ — x^l, the n — 1 
space normal to grad F(x0, and the direc- 
tion e^ in E’ as the projection of — grad F(v0 
on EL Find n + 3 more initial valley points v' 
(i ~2 ... n + 4) by taking steps d’ = (n + 1)/ 
(n -|- from v‘“i in the direction e*~^ 

to x‘, and then by minimizing F in the direction 
of — grad F(x0 from x' to v‘, for all values of 
i = 2 . . . n + 4. 

( — 2) FLATS. Fit an m flat (i.e. a linear m 
space) to the lowest m + 2 valley points in n 
space, such that its dimension m is as small as 
feasible. (The feasibility could be judged by 
observing successive standard deviations a„, of 
the points from the m flats for the sequence of 
decreasing m — values : m = n, n — 1, . . . , 1. 
A sudden large increase of ^k-i over ak would 
indicate that m = k is feasible while m = k — 1 
is not feasible.) Skip to ( — 4) if m = 1. 

( — 3) SPHERES. Fit the surface of an m- 
dimensional sphere to the projections of the low- 
est m + 2 valley points on the above m flat. 


Some shell of this sphere will approximate the 
course of the (hyper — ) valley. Pin-point next 
an optimum surface point on this shell to which 
to go next, find its valley point, and return to 

(- 2 ). 

( — 4) PARABOLAS. Find the parabola 
through the projections of the lowest three val- 
ley points on the plane spanned by the function 
axis and the regression line found in ( — 2). Go 
to the minimum of this parabola, find its valley 
point, and return to ( — 2) unless the mTnim-um_ 
value of the function has begun to stabilize. 

( — 5) TAIL. Minimize the function along a 
line through the lowest two valley points in n 
space. Find the valley point corresponding to 
this directional minimum. Repeat ( — 5) till the 
function minimum has stabilized to the desired 
number of figures. 

The above described Spherical Shells Method 
will be programmed at General Dynamics- 
Astronautics in the near future, and results will 
be published when available. 

(3) EXAMPLES AND COMPARISONS 

The following functions of x and y were mini- 
mized. (r = (x2 -|- y2)Va and ^ — tan— ^ (y/x).) 


(3.1) 

ROSIE 

= 100 (y — X-)- 

+ 

(1 — x)^ 

(3.2) 

SHALOW 

= (y — x^')"’ 

+ 

(1 — x)^ 

(3.3) 

STRAIT 

= (y — x=^)^’ 

+ 

100(1 — x)" 

(3.4) 

CUBE 

= 100 (y — X*)- 

+ 

(1 — x)^ 

(3.5) 

DOUBLE 

= 100(r — l)-(r- 

-2)^ + 

0.1 ( 2— x)- 

(3.6) 

HEART 

= 1000 ( 1 + cos 9 — 

r/10)" + y"(x— 10)^ 


All were programmed in double-precision for 
an I.B.M. 7094 computer. Results are listed in 
the following tables, and the results for ROSIE 
are discussed in detail and compared with other 
methods. 

(3.1) ROSIE 

Minimum (F = 0) at (1, 1), with a steep 
valley along y = x-, and a side valley along the 
negative y-axis. 

Table (3.1A) summarizes the end results of 
various investigators. Table (3. IB) summarizes 
the rapidity of convergence of some of these 
methods. Table (3. 1C) shows the rate of con- 
vergence of the circular arcs method for various 


starting points. Table (3.1D) gives minimiza- 
tion details for the circular arcs method. 

Table (S.lA) End results of various methods of 
minimizing the function ROSIE when starting 
from the point (x, y) = ( — 1.2, 1) 
k = number of valley points found. 

N =: number of function evaluations 

Note : The value of the function given by ref. 
(5) in the table is not consistent with the values 
given for x and y. The correct value of F equals 
1 ( — 6) for (x, y) = (1.0001, 1.0001) . 

Discussion: Parentheses in column N indicate 
that the number of function evaluations was not 
published in the indicated papers, but instead 
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was estimated by us to correspond to each indi- 
vidual method, assuming an average number of 
8 function evaluations for minimizing the func- 
tion in a given direction. The method of row 5, 
ref. (7), requires the programming of the 
analytic expression of the gradient of F; none 


of the other methods do. The numbers in paren- 
theses in the (min F) column give exponents of 
the powers-of-ten factors. It is obvious that the 
method of ref. (10) is the least efficient, while 
the circular arcs method is the most efficient for 
the function ROSIE. 


Table (3.1A) 


k 

N 

min F 

X 

y 

Method 

17 

146 

6(— 

8) 

0.999957 

0.999938 

“Lin. Search”, this paper, 1959 

— 

200 

2(— 

5) 

0.995 

0.991 

Rosenbrock, ref. (4), 1960 

33 

(264) 

8(— 

9) 

1.0001 

1.0001 

Powell, ref. (5), 1962 

85 

(765) 

6(— 

5) 

1.000853 

1.002077 

Baer, ref. (10), 1962 

18 

(144) 

K— 

8) 

? 

? 

Flethcer, ref. (7), 1963 

12 

103 

3(— 

9) 

1.000008 

1.000010 

“Cir. Arcs”, this paper, 1964 


Table (3.1B) 


k 

0 

3 

6 

9 

12 

15 

18 

21 

24 

27 

30 

33 


References : 

(E) 

Ref. (5) 

(G) 

Ref. (7) 

“Linear 

Search” 

“Circ. 

Arcs” 

k 

2.4 -t-1 

2.4 -fl 

2.4-)-l 

2.4-1- 1 

0 

3.6 0 

3.7 0 

3.5 0 

3.5 0 

3 

2.9 0 

1.6 0 

1.9 0 

7.3—2 

6 

2.2 0 

7.5—1 

8.3—1 

8.9—3 

9 

1.4 0 

2.0—1 

3.4—1 

3.4—9 

12 

8.3—1 

1.2—2 

1.1—1 

— 

15 

4.3—1 

1 —8 

6.0—8 

— 

18 

1.8—1 

— 

— 

— 

21 

5.2—2 

— 

— 

— 

24 

4 —3 

— 

— 

— 

27 

5 —5 

— 

— 

— 

30 

8 —9 

— 

— 

— 

33 


Table (3.1B) Rate of minimization of function 
ROSIE for four different methods, all starting 
from ( — 1.2, 1) . All function values are in the 
powers-of-ten notation, e.g. 2.4 -f 1 = 24 or 
3.4 — 9 =: 0.0000000034. 

k = number of valley points found 

The table clearly shows the better rate of con- 


vergence of the circular arcs method for the 
function ROSIE. 

Table (3. 1C) Rate of minimization of function 
ROSIE with the circular arcs method for five 
different starting points (x, y). Bottom row 
gives the total number, N, of function evalua- 
tions. The location of the minimum is at x = 1 
and y 1. 
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Table (3.1C) 


X 

—1.200 

5.621 

—0.221 

—2.547 

—2.000 

X 

y 

1.000 

—3.635 

0.639 

1.489 

—2.000 

y 

k 






k 

0 

2.4+1 

1.2+5 

3.6+1 

2.5+3 

3.6+3 

0 

1 

4.1 0 

9.8+2 

2.0 0 

5.4 0 

2.2+2 

1 

2 

3.8 0 

2.6+1 

8.9—1 

3.2 0 

2.6 0 

2 

3 

3.5 0 

1.5 0 

5.5—1 

1.4 0 

1.4 0 

3 

4 

3.3 0 

2.4+2 

2.7—1 

8.6—1 

2.1—2 

4 

5 

3.2 0 

1.1—4 

1.4—1 

2.2—1 

5.2—2 

5 

6 

7.3—2 

7.5—1 

5.8—2 

1.9—1 

1.7—3 

6 

7 

3.3—3 

4.7—1 

3.5—2 

1.8—1 

1.6—3 

7 

8 

8.4—3 

5.3+2 

1.2—2 

7.4—2 

1.4—4 

8 

9 

8.9—3 

4.7—1 

2.6—3 

1.9—2 

7.5—6 

9 

10 

7.0—7 

4.9—1 

4.1—4 

5.1—3 

6.1—8 

10 

11 

4.4—6 

3.2—2 

1.2—5 

2.7—3 

3.7—11 

11 

12 

3.4—9 

4.2—3 

2.6—7 

6.4—5 

— 

12 

13 

— 

2.6—5 

6.6—10 

3.4—6 

— 

13 

14 

— 

1.3—7 

— 

4.3—8 

— 

14 

15 

— 

5.3—10 

— 

2.1—11 

— 

15 

16 

— 

8.9—14 

— 

— 

— 

16 

N 

103 

195 

93 

118 

101 

N 


Table (3.1D) 


k 

N 

F 

t 

X 

y 

R 

grad 

total 








correction 

0 

1 

24.2 


—1.2000 

1.0000 




1 

9 

4.1 

1—2 

—1.0298 

1.0694 


2—1 


2 

17 

3.8 

1—2 

—0.9464 

0.9043 


2—2 


3 

25 

3.5 

1—2 

—0.8800 

0.7829 


2—2 


4 

33 

3.3 

1—2 

—0.8266 

0.6924 


1—2 


5 

41 

3.2 

1—2 

—0.7852 

0.6256 


2—2 


6 

59 

7—2 

1—2 

1.2705 

1.6151 

4.1 

5-0 

2—0 

7 

69 

3—3 

1—2 

0.9427 

0.8885 

1.2 

3—1 

8—1 

8 

75 

8—3 

4—3 

1.0918 

1.1923 

1.3 

5—2 

3—1 

9 

82 

9—3 

4—3 

1.0940 

1.1972 

1.4 

3—2 

3—1 

10 

89 

7—7 

3—3 

1.0008 

1.0017 

7.1 

4—4 

1—1 

11 

95 

4—6 

3—6 

1.0021 

1.0042 

5.8 

6—6 

3—3 

12 

103 

3—9 

7—7 

1.000008 

1.000010 



2—3 


Table (S,lD) Details of minimization prog- 
ress with function ROSIE using circular arcs 
method and starting at ( — 1.2, 1) . 

k = number of valley point 

N = number of function evaluations 


F =: function value : 

for k = 0 ... 5 in decimal notation, 
for k = 6 ... 12 in powers-of-ten notation, 
e.g. ‘‘7 — 2^’ = 0.07. 
t = accuracy achieved by F (k) , 

= tolerance to be satisfied by F (k + 1) . 
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The t-values are calculated from (F(k) — 
F(m))/(F(m) + C), where F(m) is the low- 
est available valley point prior to F(k), and 
where C is a suitably chosen constant (here 
C = 1,23456) to avoid the possibility of the 
divisor tending to zero. These t-values are then 
used as a tolerance when making the next 
gradient correction to find F(k + 1). This 
automatically tightens the tolerance as the 
minimum is approached. 

X, y= coordinates of starting points 
and all valley points. 

R= radius of circle fitted to low- 
est three or four valley 
points. 

grad correction = distance from predicted point 
on circular arc to gradient 
corrected point in valley, 
total correction = distance from previous low- 
est valley point to new valley 
point. 

The first five valley points (k = 1 . . . 5) were 
found by the starting procedure described in 
(2.2). The first circle (with radius R = 4.1) 


is a circle through points 3, 4, and 5; its pre- 
dicted low point was at x =: 6.4 and y = 12.2. 
It was corrected to give the indicated valley 
point (k = 6). The next valley point (k = 7) 
was obtained from a circle fitted to points 3, 4, 

5, 6. Likewise (k = 8) was obtained from 4, 5, 

6, 7 ; and (k = 9) from 5, 6, 7, 8. Valley point 
(k = 10) was obtained from a circle which 
was no longer fitted to any of the five starting 
points, but instead only to the much more accu- 
rate points 6, 7, 8, 9 ; hence, we had a sudden 
speed-up in the convergence from point 9 to 
point 10, as evidenced also by the sudden de- 
crease in the ratio of the grad correction to 
the total correction. Also, the radius R began 
to approximate more accurately the radius of 
the osculating circle (R = 5.59) of the valley 
y =: x2 at the minimum point (1, 1). Also, the 
program had automatically switched at this 
time from the regression plane to the regres- 
sion line when predicting point 10 on the cir- 
cular arc (sec. 2.5). The last valley point 
(k = 12) was the result of applying the tail 
correction (2.6) ; note the sizeable increase in 
accuracy. 


(3.2) SHALOW 

Minimum (F 0) at (1, 1) with valleys along y =: x- and x ~ 1. SHALOW is similar to the 
function ROSIE, but has a shallow valley compared with the steep valley of ROSIE. 

Table (3.2) 


X 

—2.000 

1.184 

0.803 

0.211 

0.820 

X 

y 

—2.000 

0.574 

—0.251 

3.505 

4.690 

y 

k 






k 

0 

4.5+1 

7.2—1 

8.4—1 

1.3+1 

1.6+1 

0 

1 

3.0 0 

2.1—2 

3.3—1 

1.1—1 

9.5—1 

1 

2 

2.2—1 

3.6—6 

1.1—1 

2.7—1 

4.0—1 

2 

3 

5.9—2 

4.3—3 

4.7—2 

4.6—2 

1.0—1 

3 

4 

9.0—2 

1.1—3 

1.3—2 

5.6—2 

2.1—3 

4 

5 

1.6—2 

6.1—4 

3.0—3 

1.6—2 

3.3—2 

5 

6 

4.9—4 

7.7—6 

9.9—5 

2.8—4 

3.4—4 

6 

7 

4.6—4 

8.1—6 

1.1^ 

1.9—4 

4.4—4 

7 

8 

3.3—4 

5.2—8 

9.9—6 

1.7—4 

2.4—5 

8 

9 

4.5—6 

2.7—10 

6.0—8 

2.5—8 

5.8—5 

9 

10 

7.1—6 

4.7—13 

1.1—10 

1.2—7 

6.7—9 

10 

11 

5.4—7 

— 

2.8—16 

8.4—13 

1.1—10 

11 

N 

93 

92 

86 

94 

96 

N 
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Table (3.2) Rate of minimization of function SH ALOW with the circular arcs method for five 
different starting points (x, y) . Bottom row gives the total number N, of function evaluations. 


(3.3) STRAIT 

Minimum (F = 0) at (1, 1) with a steep valley along x = 1. 

Table (3.3 


X 

2.000 

2.000 

2.019 

1.992 

1.986 

X 

y 

—2.000 

—2.322 

—1.505 

—3.222 

5.227 

y 

k 






k 

0 

1.44-2 

1.44-2 

1.34-2 

1.54-2 

9.94-1 

0 

1 

8.4 0 

1 . 04-1 

5.8 0 

1.74-1 

1.74-1 

1 

2 

3.4 0 

4.6 0 

1.8 0 

9.0 0 

1 . 04-1 

2 

3 

1.3 0 

2.1 0 

3.9—1 

5.4 0 

7.3 0 

3 

4 

3.0 — i 

7.3—1 

1.6 — 3 

2.9 0 

4.8 0 

4 

5 

1.7—2 

1.9—1 

1.6—1 

1.7 0 

3.5 0 

5 

6 

5.5—8 

6.2—6 

9.0—11 

1.6—3 

3.7—2 

6 

7 

3.4—5 

4.6—3 

8.9—9 

1.9—1 

3.8—1 

7 

8 

9.6—9 

3.6—6 

— 

2.2—4 

1.2—2 

8 

9 

3.9—5 

1.8—8 

— 

1.9—5 

5.1—5 

9 

10 

0.0 0 

2.7—14 

— 

1.6—12 

2.7—5 

10 

11 

— 

6.4—15 

— 

0.0 0 

5.7—8 

11 

12 

— 

— 

— 

— 

2.5—11 

12 

13 

— 

— 

— 

— 

0.0 0 

13 

N 

78 

83 

58 

83 

95 

N 


Table (3.3) Rate of minimization of function STRAIT with the circular arcs method for five dif- 
ferent starting points (x, y). Bottom row gives the total number, N, of function evaluations. 
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(3.4) CUBE 

Minimum (F — 0) at (1, 1) with a steep valley along y = x®. 

Table (3.4) 


X 

1.200 

1.391 

1.243 

0.284 

—1.200 

X 

y 

k 

—2.000 

—2.606 

—1.974 

—3.082 

—1.000 

y 

k 

0 

1.4-1-3 

2.8- I -3 

1.5- t -3 

9.6+2 

5.8+1 

0 

1 

4.6 0 

1.7- I -2 

8.2+1 

2.1 0 

4.1 0 

1 

2 

4.8—1 

1.8 0 

1.2 0 

5.4 0 

3.8 0 

2 

3 

5.2 0 

2.4—3 

1.2—1 

1.0+1 

3.6 0 

3 

4 

1.8—1 

3.5—1 

7.2—1 

1.7—1 

3.4 0 

4 

5 

1.3—3 

4.8—3 

2.3—2 

3.8—1 

3.3 0 

5 

6 

1.9—2 

1.3—5 

4.5—2 

1.5—1 

1.5+2 

6 

7 

2.4—2 

1.6—5 

3.7—2 

6.0—2 

3.4 0 

7 

8 

6.3—4 

8.5—6 

2.5—2 

7.4—2 

6.7—4 

8 

9 

1.2—6 

2.5—9 

1.7—2 

1.0—1 

4.9—3 

9 

10 

6.7—7 

8.8—11 

2.0—3 

7.9—2 

1.9—3 

10 

11 

7.5—10 

— 

5.5^ 

1.6—2 

6.2—5 

11 

12 

— 

— 

5.9—5 

6.4—3 

2.6—6 

12 

13 

— 

— 

1.8—6 

2.4—3 

1.9—8 

13 

14 

— 

— 

3.6—8 

2.5—4 

4.2—11 

14 

15 

— 

— 

5.6—11 

2.3—5 

— 

15 

16 

— 

— 

— 

8.5—7 

— 

16 

17 

— 

— 

— 

7.2—9 

— 

17 

18 

— 

— 

— 

4.7—12 

— 

18 

N 

99 

98 

113 

135 

162 

N 


Table (S.U) Rate of minimization of function CUBE with the circular arcs method for five dif- 
ferent starting points (x, y). Bottom row gives the total number, N, of function evaluations. 
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(3.5) DOUBLE 

One minimum (F = 0.099899899) at (x = 1.001005; y = 0), another minimum (F = 0) at (x 
= 2 ; y = 0) ; with one steep valley along r = l,and another steep valley along r = 2 ; saddle points 
around ( — 2, 0) and ( — 1, 0). 

Table (3.5) 


X 

—4.000 

—3.709 

—3.935 

—5.392 

—7.633 

X 

y 

0.000 

—0.957 

0.042 

—1.708 

—3.785 

y 

k 






k 

0 

3.6-h3 

2.7-I-3 

3.2+3 

2.9+4 

2.4+5 

0 

1 

1.6 0 

1.5 0 

1.6 0 

1.5 0 

1.4 0 

1 

2 

7.3—1 

6.5—1 

7.3—1 

5.8—1 

1.7 0 

2 

3 

5.1—1 

1.1 0 

7.3—1 

7.2—1 

2.5 0 

3 

4 

1.8—1 

1.5—1 

1.8—1 

2.2—5 

1.0 0 

4 

5 

7.5—1 

4.4—1 

6.2—1 

7.0—2 

2.4 0 

5 

6 

2.4—1 

1.6—1 

2.2—1 

4.4—4 

1.0 0 

6 

7 

2.6—1 

1.5—1 

2.3—1 

5.2-^ 

1.1 0 

7 

8 

1.3—1 

1.2—1 

1.8—1 

2.9—5 

1.4—1 

8 

9 

1.1—1 

1.3—1 

1.1—1 

1.8—6 

1.1—1 

9 

10 

1.0—1 

1.0—1 

1.0—1 

1.5—6 

1.2—8 

10 

11 

1.0—1 

1.0—1 

1.0—1 

1.2—6 

5.7—6 

11 

12 

1.0—1 

1.0—1 

1.0—1 

— 

2.3—6 

12 

13 

1.0—1 

1.0—1 

1.0—1 

— 

1.2—8 

13 

14 

1.0—1 

— 

1.0—1 

— 

—14 

14 

15 

— 

— 

1.0—1 

— 

— 

15 

N 

115 

113 

118 

106 

139 

N 


Table ( 3.5 ) Rate of minimization of function DOUBLE with the circular arcs method for five dif- 
ferent starting points (x, y). Bottom row gives the total number, N, of function evaluations. 
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(3.6) HEART 

Four minima (all with F = 0) at left (0, 0), at right (20, 0), at top (10, -f 12.72), and at bot- 
tom (10, — 12.72). Three valleys: along the x-axis, along x = 10, and along the cardioid r 10 (1 
+ cos 6). 

Table (3.6) 


x 

—1.000 

—9.870 

—11.43 

—6.255 

—7.539 

X 

y 

0.000 

'—2.159 

—7.344 

—21.13 

—27.04 

y 

k 

0 

1.0+3 

7.3+5 

1.1+7 

3.1 +7 

6.9 +7 

0 

1 

9.5+1 

8.8+2 

4.1+2 

5.3 +1 

8.6 +2 

1 

2 

9.9+2 

5.2+2 

7.6+2 

1.0 +3 

1.4 +3 

2 

3 

4.8+2 

3.1+2 

4.2+2 

1.9 +1 

6.2 +1 

3 

4 

1.6+1 

1.9+2 

1.2+2 

3.9 +2 

7.4 +2 

4 

5 

8.3—2 

1.1+2 

5.8—2 

2.5 +2 

3.6 +0 

5 

6 

1.8—2 

1.4+1 

3.4 0 

3.0 +1 

3.6 +0 

6 

7 

1.4—4 

2.2 0 

1.8—3 

4.5 +0 

1.8 —1 

7 

8 

1.3—3 

2.3+1 

2.2—3 

2.4 —8 

4.3 —6 

8 

9 

9.1—4 

8.7 0 

3.7—5 

1.2 —2 

2.7 —3 

9 

10 

3.1^ 

6.2 0 

3.5—9 

3.3 —10 

4.1 —3 

10 

11 

1.7—4 

2.1 0 

6.7—7 

6.9 —4 

4.1 —4 

11 

12 

7.7—5 

5.6—3 

8.2—12 

5.7 —11 

3.3 —5 

12 

13 

4.8—4 

8.3—6 

(bottom) 

(right) 

6.8 —6 

13 

14 

9.7—5 

1.7—7 

— 

— 

2.0 —6 

14 

15 

7.3—5 

1.7—7 

— 

— 

(bottom) 

15 

16 

7.1—5 

(top) 

(left) 

— 

— 

— 

16 

N 

149 

127 

119 

121 

150 

N 


Table (S.6) Rate of minimization of function HEART with the circular arcs method for five differ- 
ent starting points (x, y) . Bottom row gives the total number, N, of function evaluations. 


CONCLUSION 

We feel that the foregoing results conclusively 
show the gain in efficiency obtainable for a 
direct minimum search procedure when an 
attempt is made to approximate by suitable 
curves the valleys of the function to be mini- 
mized. The circular arcs method which strives 
to do so can only be a beginning. Various steps 
and details of its procedure can and will doubt- 
lessly be improved in the future ; most interest- 
ing should be the results of its generalization to 
n variables. It is possible that the gain in effi- 
ciency is larger when the circular arcs method 
is applied to functions with steep-sided valleys 
than when it is applied to functions with more 


clearly defined minimum points and less clearly 
defined valleys. On the other hand, when pen- 
alty terms are added to an objective function, 
in order to also observe inequality constraints, 
functions with steep-sided valleys will auto- 
matically be generated; thus, the methed may 
become useful in the field of non-linear pro- 
gramming. The authors wish to acknowledge 
the constant support and encouragement they 
received from Dr. Wm. J. Schart, Chief of 
Mathematical Analysis at General Dynamics/ 
Astronautics, and Mr. Carl E. Diesen, Manager 
of Scientific Programming and Analysis at 
General Dynamics/Astronautics. We also wish 
to thank Mrs. Marilee Culver for the efficient 
and neat typing of the manuscript. 
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ON THE EVALUATION OF THE COST-EFFECTIVENESS 
OF COMMAND AND CONTROL SYSTEMS 

N. P. Edwards 

Weapons Systems Evaluation Group 
Washington, D. C. 


INTRODUCTION 

1. some time now I have been previleged 
to be associated with a project which has been 
taking a critical, and hopefully, constructive 
look at the workings of the National Military 
Command system. Working in the Pentagon one 
regularly hears the expression “Cost-effective- 
ness.” Sooner or later someone had to combine 
command and control with cost-eflfectiveness. 
This paper then addresses the cost-effectiveness 
of military command and control systems. 

2. Command and control is a broad and very 
active field as can be seen from the many 
articles in military and trade publications and 
in the popular press. For instance, the Armed 
Forces Management magazine devoted its July 
1963 issue to command and control. I would like 
to review some of the titles of the various 
articles : 

— “NMCS: The Command Backup to 
Counterforce” 

— “Total Command Control Through Com- 
puters?” 

— “Taylor’s Flexible Response Strategy 
Shifts C&C Perspective” 

— “Electronic C&C Will Aid Army Com- 
mander, Not Replace Him” 

— “We Have More Than 800 C&C Systems” 

— “Resource Management: A New Slant on 
C&C” 

—“Real Time C&C Aids USAF, NASA” 


One of these also states “. . . C&C system is a 
communications system . . .”* Under the aegis 
of C&C, Electronic News reports that Mitre has 
developed “. . . a new approach, whereby com- 
puters will be provided with a broad, flexible 
program which contains nothing regarding the 
using commanders’ specific problems. The pro- 
gram — which will be a ‘learning program’ based 
on artificial intelligence — ^will enable the com- 
puter to leam by experience as the system com- 
mander feeds his specific problems into it.”^ 

3. From this small sample of the current 
literature, it can be seen that the expression 
“military command and control systems” has a 
wide range of meaning to different users, and 
covers a broad scope of systems and activities. 
If a meaningful attempt is to be made to discuss 
cost-effectiveness of command and control sys- 
tems, it is essential to find out what we mean by 
command and control. So let us begin by de- 
fining the individual terms and attempting to 
break the problem of cost-effectiveness into 
manageable segments for examination. 

4. “Command and control” as defined in the 
Dictionary of the United States Military Terms 
for Joint Usage {JCS Pub. f ) is “An arrange- 
ment of personnel, facilities and the means for 
information acquisition, processing and dis- 
semination employed by a commander in plan- 
ning, directing and controlling operations.” 
Note that the word “system” is not used. 

5. A “command and control system” as the 
term is normally used, includes many different 
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functions. In order to beg'in to separate these 
so that they can be handled individually, let us 
differentiate between the command function 
and the control function. We will first define 
and describe the characteristics of control sys- 
tems : These are a class of systems sometimes 
included in the general definition of command 
and control, but which we propose to exclude 
from the subsequent discussion, except for use 
as a basis for analyzing the command subsys- 
tems we are really interested in. 

CONTROL SYSTEMS 

6. As normally used in the technical sense, 
a control system regulates the action or activity 
of a person or device in order to perform a 
prescribed function. The key point here is that 
the mission to be accomplished is understood 
exactly and the means of accomplishing this end 
are also well understood. A control mechanism 
operates in accordance with precisely defined 
laws. It senses the performance of the con- 
trolled device and corrects deviations from the 
desired course in accordance with mathematical 
formulae. Control systems frequently have 
people in the control loop. An aircraft controller 
is an example of a human link in a control 
system. While the controller does not operate 
in a completely predictable mode, the majority 
of his actions are predictable in any except an 
emergency situation. In a weapons control sys- 
tem, of which SAGE is probably the largest 
single example, the majority of the functions 
can be described mathematically. Those func- 
tions which cannot be described mathematically 
are referred to human judgment. However, 
the ultimate goal is to eliminate the require- 
ment for human judgment from the control 
system once the command to execute has been 
issued. Some systems, such as missile weapons 
systems and antimissile systems, operate at 
such a high speed that it is necessary to elimi- 
nate the human operators from the control 
system loop, A distinguishing characteristic of 
control systems (as defined in common technical 
usage) is “feedback.'^ This is true regardless of 
whether the system controls things or people. 
“Feedback’’ is the process by which a control 
system constantly senses what is occurring and 
adjusts the system to achieve the desired result. 
An aircraft controller directing the landing ap- 


proach of an airplane uses feedback. He con- 
tinually follows the actions of the plane and 
gives correcting instructions in case the plane 
is not following the desired course. 

7. The diagram in Figure 1 is an oversimplifi- 
cation but it is intended to illustrate the de- 
pendence on sensors and feedback, and the 
prescribed nature of the action. 



CONTROL SUBSYSTEM 

Figure 1. 


COST AND EFFECTIVENESS OF 
CONTROL SYSTEMS 

8. Various control systems which are com- 
pletely (or for the most part) mechanized, have 
been in production for many years. Such sys- 
tems include aircraft autopilots, instrument 
landing systems, missile guidance systems, tank 
and naval gun stabilizing and aiming systems, 
and many industrial process control systems. 
In the case of an industrial control system, such 
as a system controlling the operation of an oil 
refinery, cost and effectiveness trade-offs have 
been very carefully examined and are well 
understood. This is possible because the proc- 
esses, procedures and desired end products are 
well defined. This is also the case in many small 
military control systems. Although the value of 
increments of improvement in the performance 
of a military control system may be difficult or 
impossible to measure, the value of an improve- 
ment in the performance of a public utility 
generating plant can be measured with great 
precision. An increase in operating efficiency 
of 0.1% may represent a significant increase in 
the profit margin. 
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9. There are three factors which predominate 
in their effect upon the accuracy of a cost- 
effectiveness estimate for a new control system. 
These factors are: value of performance, how 
well the function to be performed has been 
defined and the performance level desired. 

a. Value of Performance. Can a value be 
assigned to an incremental improvement in 
the performance of the system? In many 
instances it is possible to identify a signifi- 
cant improvement in performance and assign 
a value to it. Two performance factors of 
interest in the control subsystem of a weapon 
system are: 

(1) Reliability — If the reliability of the 
weapons control system can be im- 
proved, this improvement can be 
reflected in the overall reliability of 
the weapon. If the control is the least 
reliable subsystem, then the improve- 
ment will probably be quite significant. 
If the control subsystem is already the 
most reliable part of the system, then 
the improved reliability may have 
little effect on the weapon’s perform- 
ance. However, it might still be well 
worthwhile because of a reduced field 
maintenance problem. The total sys- 
tem cost and increased efficiency must 
be taken into account. 

(2) Accuracy — ^An improvement in the ac- 
curacy of the control system may 
significantly reduce the number of 
weapons required if the target loca- 
tion is known to a higher accuracy 
than the accuracy of the weapon. Sup- 
pose the target location is known 
'‘exactly.” If the accuracy of delivery 
of the weapon is such that the target 
will always be within the radius of 
destruction, only one weapon (shell, 
iron bomb, or other) is required. If 
the error in aim and delivery is greater 
than the kill radius, we have a more 
expensive and probably less efficient 
situation (see Figure 2). The number 
of weapons required goes up loga- 
rithmically as the probable dispersion 
gets significantly larger than the 
radius of destruction of the weapon. 



IMPACT 
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Formulae and nomographs exist for 
calculating the weapons required for a 
desired probability of destruction of 
the target.^ 

If you cannot assign a meaningful value to 
incremental improvements in the system, a cost- 
effectiveness projection cannot be made. How- 
ever, analysis can still be used, and it still may 
be possible to make a relatively accurate com- 
parison of the projected cost of different ap- 
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make a proper (or best possible) decision using 
the facts made available by eyeful analysis. 

b. Definition of Function. Are the functions 
to be performed by the control system defined 
and will they remain constant during the de- 
sign, test, and production of the system? If so, 
one of the necessary conditions for an accurate 
cost projection has been met. If not, it is certain 
that the initial cost estimates will be signifi- 
cantly in error. The more poorly defined are the 
functions to be finally performed, the greater 
the probable error in the cost estimate. It would 
appear that it should be relatively easy to deter- 
mine the functions required of a control system. 
However, if the device to be controlled is being 
developed concurrently, and by a different engi- 
neering group or company, as is frequently the 
case, it is probable that control requirements 
will occur which were not anticipated. It is 
impossible to describe this risk numerically in 
the abstract, as each case will be different. 

c. Performance Levels. Is the system’s per- 
formance specified? Will this performance re- 
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main essentially constant during the design, 
test, and production of the systems ? If so, an- 
other necessary condition for a good cost esti- 
mate has been met. If not, the estimate has a 
high probability of being wrong. Is the per- 
formance required comfortably within the pres- 
ent state of art? If so, a major condition 
necessary to permit an accurate cost estimate 
has been met. If not, R&D is required and the 
estimate may be in error by factors of tens 
or hundreds. 

10. Even where a new hardware/software 
system is of a kind which has been designed 
and produced repeatedly, accurate cost-effec- 
tiveness measurements or predictions cannot be 
made unless the value of what is to be accom- 
plished is known, how it will be accomplished, 
and whether it can be done without R&D. This 
discussion has been included to illustrate the 
minimum conditions which must be met before 
reasonable cost-effectiveness projections can be 
made of any system or equipment. 

COST-EFFECTIVENESS OF ^‘COMMAND 
AND CONTROL’^ SYSTEMS 

11. For this discussion, three parts of the 
“command and control’* function will be con- 
sidered. These are more accurately called “Com- 
mand and Support Systems.” Figure 3 is a very 
simplified version of a very complex and per- 
sonal activity. It is complex because it is an 
activity of sophisticated people in a complicated 
and changing environment, reacting to ambigu- 
ous situations which are inaccurately and in- 
completely reported. It is personal because each 


COMMANDER 



COMMAND SUBSYSTEM 

Figure 3. 


commander has his own particular likes, dis- 
likes, prejudices and means of operating. The 
commander will augment any formal “subsys- 
tem” we may build in any way he sees fit. The 
augmentation may be temporary or permanent 
(for the tenure of the commander) but it is not 
part of the subsystem discussed here. Realizing 
that we are discussing command support sub- 
systems, let us consider each of the three 
selected which are: 

а. Data-Gathering Subsystem. A subsys- 
tem which gathers information necessary to 
support a command decision. 

б. Analysis Subsystem. A subsystem which 
provides or supports the analysis of the infor- 
mation. (The command function is not con- 
sidered a subsystem.) 

c. Transmission Subsystem. A subsystem 
for the promulgation of the decision to the 
appropriate subordinates. 

12. The function of command, i.e., evaluation 
of the information and the command decision, 
is not addressed. This simplified diagram would 
emphasize the facts that : 

a. Commanders use many inputs, including 
feedback. 

b. The commander knows and relies on the 
intelligence and experience of the person re- 
sponsible for executing the order. 

13. Webster’s New World Dictionary uses 
the phrase “Command . . . when it refers to 
giving orders, implies the formal exercise of 
absolute authority . . . order sometimes suggest- 
ing an arbitrary exercise of authority ...” This 
is a clear recognition of the fact that it is not 
possible to describe rigorously the transfer 
function involved in the command process. This 
is true not only in general ; it is difficult to think 
of a single specific command situation in which 
a series of logical conditions can be defined 
which describe the specific performance re- 
quired of the commander. However, it is pos- 
sible to describe a few selected situations in 
which the correct performance can be readily 
predicted and completely defined. A doctrinal 
response is such a case. However, it is easier 
to describe one of the infinite number of 
situations in which the correct performance 
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cannot be completely defined in advance. Also, 
it will probably not be possible to determine if 
the decision was correct for days, or even 
months or years. 

14. Since the command process itself cannot 
be described precisely, or for that matter with 
any degree of precision whatever, it is mean- 
ingless to talk about cost-eflfectiveness predic- 
tions for the command function. Specific 
arbitrary situations can be defined (as is done 
in a war game), decisions postulated and the 
predicted results evaluated. This is useful to 
contribute to the learning process of com- 
manders. It may also be useful to assist in 
evaluating a commander’s thinking in the 
absence of a real war (certainly one would not 
want to leave the fate of a real military decision 
exclusively in the hands of a computer war 
game) . While industry has for some years used 
a business game as a means of training and 
evaluating executive management, they do not 
leave real business decisions in the hands of 
this business game. 

DATA-GATHERING SUBSYSTEM 

15. Any commander (or business executive) 
needs information on which to base his decision. 
Military data-gathering systems have existed 
since before man first picked up a club. General 
Custer had a data-gathering system which pro- 
vided him with adequate information to have 
prevented the massacre. His scouts told him 
that the Sioux were as numerous as blades of 
grass on the prairie. It was in the data evalu- 
ation that Custer’s command system was at 
fault. 

16. Since some data-gathering systems are 
well understood, it should be possible to estab- 
lish a cost-effectiveness comparison between 
various systems designed to achieve a specific 
performance. However, remember that this 
comparison is subject to all the conditions which 
limit the accuracy of cost-effectiveness esti- 
mates described above for control systems. It 
should also be possible to predict (with varying 
accuracy) the cost and performance of a given 
data-gathering system. However, it is certainly 
not always possible to accurately assess the 
value of increments of performance of a data- 
gathering system. No generalized observations 


can be made concerning timeliness, accuracy or 
reliability of the system. These are functions 
of the specific situation. Consider these major 
performance factors: 

a. Timeliness. How much is it worth to 
have the data a day, hour, five minutes, or 
ten seconds sooner? This is completely de- 
pendent upon the nature of the system, the 
nature of the situation, and the nature of the 
data. Given a specific data requirement, it is 
probably possible for an experienced military 
commander to put an arbitrary (approxi- 
mate) value on the timeliness of the data. It 
is the commander’s responsibility to make 
this decision since he is the one person who is 
held responsible for the consequences of his 
decisions. 

b. Accuracy. How much is accuracy worth 
in a data-collection system? This again is 
dependent upon all the factors listed above 
and also on the accuracy of the raw data and 
the quantity of the data. Given a specific re- 
quirement for data, arbitrary and approxi- 
mate values can be assigned by the com- 
mander. It is not possible to do this in the 
abstract. (The accuracy of the system could 
be defined as the percentage of the data en- 
tered into the system w'hich arrives un- 
changed at the output of the data-collection 
system.) 

c. Reliability. This could be defined as the 
percentage of the time that the system is 
performing in its normal manner. What 
effect will a temporary or permanent failure 
have? Here again, the final answers are de- 
pendent on the specific system and can be 
assessed by experienced judgment only. 

17. To reiterate, it is possible to compare the 
relative cost and performance of various data- 
collection systems designed to do a specific job. 
If the performance is equal, or assumed to be 
equal, the relative cost can be compared. The 
problem is, proposed systems are likely to be 
only approximately equal and then only where 
a system has been designed in response to a 
rigid set of specifications such as may be the 
case when contractors respond to a request for 
proposal. Even here a data-collection system is 
usually rather complex and the variations of 
system performance implicit in the proposals 
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are very likely to be of significance to the mili- 
tary commander. 

18. Certain types of command situations per- 
mit a relatively accurate and profitable assess- 
ment of the value of timeliness, accuracy and 
reliability. Consider the case of a moving tar- 
get with a known top speed. Knowledge of its 
exact present location is limited by the speed, 
accuracy and reliability of the reporting system. 
If we don't know of any restraints on its direc- 
tion of travel, we must assume the target has a 
certain probability of being within a circle 
whose radius is determined by its speed and the 
age and quality of our knowledge of its last 
position. If we assume, for simplicity, that we 
have an accurate, reliable delivery system and 
a certain radius of kill, we can calculate the 
number of weapons which must be applied to 
the target area to give a desired probability of 
destroying the target. A method giving mini- 
mum overlap of destructive areas is the in- 
scribed hexagon model.^ The target area is 
covered by hexagons inscribed into radius of 
damage circles. If the radius of the target and 
the radius of desired damage are equal, one 
weapon is required. If the radius of the target 
is 5 times the radius of destruction, 37 weapons 
are required. (See Figure 4.) The number of 
weapons goes up as a function greater than, but 
asymptotic to, the square of the linear uncer- 
tainty as to the location of the target. This 
uncertainty includes, when you are estimating 
the number of weapons to stock : 

a. Reporting accuracy. 

b. (Speed of target) x (Probable reporting 

time loss). 



MOTION 


Figure 4. 


c. A safety factor to allow for the fact that 

the information you have may be older than 

you think {reliability of the reporting sys- 
tem). 

This analysis still will not give you the total 
cost of being effective, because it does not 
include such things as the probability of being 
able to deliver more than one weapon, time loss 
if weapons are delivered sequentially, or the 
reaction of the target if you miss the first time 
and get it excited, etc., etc. This simple example 
is virtually a control problem, but illustrates the 
point. 

19. Assignment of absolute values to the total 
aspects of the performance of these systems 
requires the highest knowledge of the situation, 
the military strategy, policy and innumerable 
other factors relating to the national welfare. 
Hence, it is necessary that judgment be exer- 
cised in making the decision as to which system 
is selected. This judgment can never be made 
with full possession of all pertinent facts. It 
will always be subject to human error and can 
always be challenged. The military commander 
is held responsible for the consequences of this 
decision and, therefore, must have final respon- 
siblity for making the decision. This does not 
belittle the contributions that can be made by 
proper application of analytical techniques. 
Careful analyses and cost projections of com- 
petitive systems can highlight areas of uncer- 
tainty as well as provide evaluations of known 
facts, and should greatly enhance the prob- 
ability of the commander making the correct 
decision. 

ANALYSIS 

20. While the function of analysis has oc- 
curred in the commander's mind, with the aid 
of his staflf, since the beginning of military 
action, it is possible to describe precisely only 
very limited aspects of the total field of infor- 
mation analysis for military command. In spite 
of this, much of the analysis requires the rapid 
and orderly retrieval of data from a large body 
of reference and current operational informa- 
tion. Facts relating to force status, logistics of 
supply and movement, geography, climate and 
installations are required, to name some of the 
more important. These must be stored in an 
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orderly fashion. However, to support analysis, 
the facts must be related and sorted into many 
different orders or categories. Various manual 
or mechanized systems to aid this activity can 
be compared for cost, convenience, speed and 
other factors. Statements can be made regard- 
ing cost and relative performance. In some 
cases it may be possible to put a specific value 
on a difference in performance, but usually it 
requires command experience and judgment to 
make the decision. And the decision as to the 
worth of a particular fact, array of facts in a 
category, or the worth of specific analysis may 
change drastically when the commander 
changes. Here, perhaps more than in any other 
command support subsystem, the effectiveness 
can be assessed only in the mind of the com- 
mander. What does he think he needs? 

SYSTEM FOR THE PROMULGATION 
OF ORDERS 

21. Here again we have a function which has 
been performed by the military since the begin- 
ning of organized warfare. Given a specific 
situation, or a general requirement, it is pos- 
sible to describe many different approaches to 
disseminating command information. These 
have included word of mouth, runner, carrier 
pigeons, bugles, drums, flags and rockets. 
Again, it is possible to compare the relative cost 
of various systems only if they have equal per- 
formance. Again, it is a matter of military 
judgment as to the value of improved perform- 
ance in such a system. Accuracy is extremely 
important. Reliability, i.e., probability that the 
command will be delivered, is of great value. 
The value of speed may be dependent in part 
upon the response time of the force commanded. 
A command transmission which takes seconds 
may be of no greater value than one which 
takes minutes where the response time of a 
military unit is a week. However, minutes or 
hours may be important even though the re- 
sponse time of a unit is days or weeks. It is 
possible to miss even a slow boat. To establish 
values of various system performance factors 
usually requires military judgment and cannot 
be generalized. Each planned application can 
be studied and in many cases definite, but not 
definitive, values can be attached to improve- 
ments in performance of some part of the sys- 


tem. Other factors which cannot be assigned 
specific values, such as giving the order a 
greater sense of urgency or authenticity, may 
be more important. 

22. It is worthwhile to single out the specific 
case of a nuclear exchange. This is certainly 
the most thoroughly studied of all the possible 
military activities which are not yet a matter 
of history. The nuclear exchange, once it is 
initiated, is usually considered to he more 
nearly a control problem than a straight com- 
mand problem. It is bounded by doctrine, a 
finite number of weapons and target choices. 
Command support subsystems with specific 
tasks can be evaluated for specific cost and rela- 
tive effectiveness. A missile detection system 
can be evaluated — with existing missiles — and 
its performance against hypothetical new mis- 
siles estimated. An analyst can even assess 
many of the advantages of having a few min- 
utes more warning time. But an evaluation of 
the dollar worth of this time is a task for the 
top military policy makers. Values can be as- 
signed to degrees of reliability and accuracy. 
These can contribute to a sound decision, as 
they would in any other weapon of destruction 
system. In general these are like or are really 
control subsystem problems and can be ad- 
dressed by an analyst with considerable hope of 
obtaining some meaningful results. 

23. The* command problem in the trans- 
attack and postattack period is more complex 
and probably less understood than conventional 
warfare. We don’t have any relevant experi- 
ence since fortunately we have never had an 
exchange. There are several questions we can 
ask about any proposed subsystem for com- 
mand support : 

a. Can the system survive to do its job? 

b. Will there be anyone to use it? 

c. Will there be any data to feed it? 

d. Will it help? 

e. What will it cost? 

24. Prior to this point an assumption has 
been made that hasn’t been stated. It has been 
assumed that the subsystem or system compo- 
nent does something useful — it contributes to 
the better solution of the problem. This is the 
first question: Is this system relevant? The 
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answer to this may save a lot of analysis time 
and many procurement dollars. If the answer 
is “Yes, the subsystem is relevant and useful,” 
the following general observations can be made : 

a. Two or more approaches to the solution 
of the problem can be compared as to cost 
and performance factors. 

b. With luck, one of the alternatives may 
be both better and cheaper. 

c. If additional performance is offered by 
the more expensive alternative, the problem 
arises of assessing the value of the increment 
of performance. Even if dollar values can 
be assigned, there are always other less tan- 
gible operational factors which must enter 
into the decision. 

d. A choice may exist as to whether or not 
to have a control system (or a particular 
weapon or device), but no choice is offered 
in the case of a command system. The com- 
mand systems exist now. So we are always 
offered a relative choice. We can use present 
methods or add support subsystems. These 
can always be compared with the existing 
way of doing business. In either event we 
have the basic command structure. Command 
systems are not a question of hardware. 

e. In all system component choices we may 
be faced with the cost estimator’s standard 
dilemma. System A may use a part of an 
existing system, while System B may have 
growth potential or reduce the cost of future 
systems. 

/. The analyses and quantifications which 
can be done using the techniques provided by 
the engineering and scientific communities 
are valuable tools which can assist the com- 
mander in making the best of an extremely 
difficult situation. 

g. The final decision as to the cost-effec- 
tiveness of a military command and control 
system should rest with the commander who 
is responsible for using the system and for 
the effect that it has on the success of his 
mission and the national security, whether 
he is a component commander or the Com- 


mander-In-Chief of the U.S. Armed Forces. 
He should have all available help, utilizing 
the best analytic techniques, in order to make 
the best decision. 

25. The commanders of the Unified and Spec- 
ified Commands have been, in fact, given the 
responsibility for their command and control 
systems by the Secretary of Defense on 26 
October 1963, according to Armed Forces Man- 
agement, January, 1964. The Commanders will 
be looking eagerly for help in solving their ex- 
tremely knotty and complex command support 
problems. I believe that virtually everyone here 
considers himself a professional, and many of 
you will be called upon to help solve these prob- 
lems. Let us take a responsible attitude, and 
recommend hardware and software only when 
it is relevant to the problem, and can contribute 
to the solution better than an improved manual 
system, and only then after we are certain that 
the improvement is worth the cost; and that 
the user wants the kind of solution we propose. 
And I mean the user, the man who will have 
to make the system work as a part of his daily 
job. Let us have the courage to say, when nec- 
essary, “Glamorous hardware and elegant soft- 
ware alone won’t solve your problem.” And 
then work with the people who have the prob- 
lem, and together find out what the problem is. 
And perhaps find the solution, which may even 
make use of good hardware and software. Our 
professional ethics and pride require it. 
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INTRODUCTION 

Every subject is said to have both an inside 
and an outside. Its outside has to do with the 
relationships between the subject and matters 
external to it ; while its inside is concerned with 
the subject’s own internal dynamics. My talk 
concerns the dynamics of information in today’s 
institutions with specific reference to the prob- 
lems of command and control. Thus I shall leave 
Questions concerned w^ith the relation of com- 
mand to strategy and the nature of future wars 
to rny colleagues who are well prepared, as I 
am not, to discuss them. 

Throughout my remarks I shall make a great 
deal of use of the word ‘"context.” And so I shall 
first clarify just what I mean to convey by that 
word. I shall mean by context the total view a 
man, or a group of men, has of his situation at 
a given time. Thus context includes knowledge 
of environment, historic perspective, plans for 
action, motives, goals, all aspects of our under- 
standing of the what, how and whys of our 
current situation. The context of a military 
headquarters is rather well-delineated. It in- 
cludes the personnel records of the command, 
intelligence, operational information such as 
status of forces, logistic re 2ords and plans, oper- 
ational plans, communication procedures, and 
certainly a clear understanding of the com- 
mand’s mission. 

In its broader meaning, command and control 
are the means by which an organization main- 
tains adequate and interconsistent contexts for 
its constituents. The inside problems of com- 


mand and control are the problems of maintain- 
ing adequate, relevant contexts under changing 
conditions and problems, and of insuring con- 
sistency across the varied and changing con- 
texts of the members of the organizational hier- 
archy. It is on these problems of contextual 
dynamics that I would like to^focus your atten- 
tion. 

THE SELF-ACCELERATION 
OF CONTEXT 

Our contexts tend to grow. The more we 
know, the greater our curiosity, the more the 
remaining unknowns provoke us. Each new 
discovery seems to create more problems than 
it solves, problems which beckon us to more 
complex, more extensive theories and under- 
standing. This quest for knowledge is not only 
for facts concerning the external world, but also 
for a better understanding of ourselves, of our 
wants and objectives, and of the relative values 
to us of the alternatives we perceive. Context 
is ever-expanding. 

If our contexts were limited to those which 
could be contained in the head of a single indi- 
vidual, our knowledge would be limited indeed. 
We have learned to build communities, com- 
munities whose principal function is to build 
and maintain a group context which exceeds in 
information that of any of its constituents by 
many orders of magnitude. Specialization can 
be viewed as a fractionization of the group con- 
text in such a way as to maximize the total in- 
formation in a community that can be brought 
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to bear on its various concerns. And the es- 
sence of organization is to insure intraconsist- 
ency across the group context while maximiz- 
ing Its informativeness. 

Our motivation for this insatiable quest for 
ever greater knowledge is clear. The more we 
know, the more efficiently we can modify our 
environment to meet our desires. Certainly our 
current standard of living is ample evidence of 
what the expanding human context can mean 
to us. But if expanding context brings greater 
control over our environment, it also extends 
our ability to gain knowledge itself. The great 
extensions of context made by Copernicus and 
Galileo and the observations of Tycho Brahe, 
opened the door for the kinematics of Kepler. 
In turn these advances showed the way for the 
dynamics of Newton, and so on through the 
discovery of the Van Allen belt and solar winds. 
This same effect is equally applicable in the 
small as in the large. When a boy learns to ride 
a bike, this new knowledge vastly increases the 
world he can explore. Contextual growth is 
self -accelerating. 

Look at this fact very carefully : Context is 
self-accelerating. If context were limited to a 
single individual, there would be a ceiling on 
contextual growth and this acceleration would 
be controlled. But by building larger and larger 
communities, we have removed this bound on 
contextual growth. If the community context 
were to grow steadily, one could conceive of 
an ever-enlarging culture. But the self-acceler- 
ation does not permit steady growth. As the 
rate, of growth of knowledge increases, as well 
as its overall extent, self-consistency of the total 
construct becomes strained. Communication 
across the context cannot keep up with the new 
discoveries and conceptual extensions taking 
place on all sides. The feeling of being out of 
touch rapidly deteriorates to a fractionization 
of community identification. The basis of com- 
munication, the commonality of context, disin- 
tegrates. Castastrophic fractionization of con- 
text ensues, and the result is the plummeting of 
the amount of information remaining in the 
community. 

The symptoms of contextual fractionization 
are many. Among them are, for example, a 
feverish preoccupation with the problems of 


military command and control, attempts to 
centralize control of departmental or corporate 
informational activities, acceleration of organ- 
izational change, greatly exaggerated evalua- 
tion of communication activities and increase in 
‘"paperwork"’ of all kinds, and standardization 
by fiat, rather than by community recognition 
of common practice. And the final desperate 
stage to secure the shattering informational 
community is dictatorship. 

Is such catastrophic fractionization of our 
communities inevitable? I do not know. Cer- 
tainly the reading of history gives us pause to 
wonder. But there are ways of dealing with the 
problem of contextual explosion. Context must 
fractionate, because the strains on communica- 
tion do become excessive, strains which build up 
faster than technological advance can relieve 
them. But we can achieve orderly fractioniza- 
tion. The constituent communities can be or- 
ganized to retain through the organizational 
hierarchy the means to bring to bear a total 
body of know-how and understanding not 
achievable by any one element of the organiza- 
tion. 

Gentlemen, the Department of Defense is 
tending toward catastrophic fractionization. 
The almost explosive rate of contextual expan- 
sion is all too clear. Intercontinental missiles 
are the result of global concerns and scientific 
advance, not vice versa. And the rate of expan- 
sion of the total defense context, indeed, of 
every facet of that context, has become explo- 
sive. There is no reason here to recite examples 
of the disintegration of communications, of in- 
consistencies of policy, or of the increasing dif- 
ficulties of initiating definitive actions. We are 
all too aware, each in our limited sphere, of this 
fractionization. And even the newspapers and 
trade journals confirm our fear that it may be 
widespread. 

Those of us gathered here represent a sizable 
part of the community that has both means and 
mission to reverse this rapid decay in overall 
information in our monolithic organizations. 
We can assist in the achievement of ordered 
fractionization that can in turn increase the 
potential for information of today’s institutions, 
and thus our potential for decisive, efficient 
action. But it is not in the increase in commu- 
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nication, the centralized data centers or the 
paternal agencies established in the name of 
efficiency, that this reversal will be achieved. 
Efficiency in the informational activities of a 
community come from entirely different sources. 
Before discussing specific approaches that our 
industry might undertake to give effective serv- 
ice at this critical time, I would like to take an 
overview of the kinds of measures a community 
can take to accommodate its expanding con- 
text — measures I believe the Defense Depart- 
ment must take if it is to maintain an adequate 
defense posture for our nation. 

STEPS TOWARD PREVENTING 
CATASTROPHIC FRACTIONIZATION 

Organize to Minimize Communication 
Requirements 

The task is not that of maintaining a single, 
all inclusive context where the centralized Sec- 
retary's Office has all the details of its far-flung 
operations at its fingertips. No single context 
can accommodate it. It is not just that no one 
man can take it all in a single thought. It is 
simply impossible to put such a context to- 
gether no matter what resources are brought 
to bear. The means of communication are not 
adequate, nor can they be made adequate, to 
maintain the consistency of such a context 
whose rate of change in every facet is as high 
as it is today. 

The essence of traditional command doctrine 
is the delegation of command prerogatives to 
the responsible officer on the scene. Let us ex- 
amine this doctrine for a moment. What do we 
mean ‘‘on the scene?" In a task force operation 
the man on the scene is the task force com- 
mander. He is directly in touch with his sub- 
ordinate command elements which he controls 
and maneuvers in the accomplishment of the 
task force objective. However, if we consider 
one of these subordinate units, say a combat 
ship or combat wing, the commander of that 
unit is on the scene relative to its control and 
the coordination of its operation. In turn, for 
the operation of the ship's power plant, the en- 
gine room officer is on the scene ; for the opera- 
tion of a particular aircraft, the aircraft com- 
mander is on the scene. 


A military operation — any operation — di- 
rectly deals with reality ; not with hypothetical 
plans, nor with far-flung systems which can 
never be fully sensed or directly experienced, 
but with individual men and individual items of 
equipment, and individual reports and individ- 
ual points of view. This reality is strangely 
peculiar to the particular operation. It may be 
an operation at the highest level, such as dealing 
with Rusk and McNamara, Khrushchev and De 
Gaulle. It may be an operation at a middle level, 
such as the execution of a blockade of Cuba 
dealing with particular ships, the personalities 
of well-known officers and with weather condi- 
tions you have personally tasted on many a 
watch. Or it may be an operation at the lowest 
level such as knowing instinctively the coverage 
pattern for a radar and the kind of returns to 
expect, and the sense to be able to distinguish 
the moon from incoming missiles. Traditional 
military command doctrine gives the responsi- 
ble officer, who is in the position to directly 
sense the reality, the means and the preroga- 
tives to deal with that reality “on the scene." 

The essence of this doctrine is twofold. First, 
the responsible officer on the scene is in the best 
position to have the relevant information. Sec- 
ond, the delegation of prerogatives assumes 
responsible reporting of the relevant facts 
upward. As a result, the requirements for com- 
munication are reduced to a minimum and the 
relevance of communication is maximized. 

To attempt to obtain at a far-away headquar- 
ters the essential details, the peculiarities of an 
operation, and to exercise detail control from a 
distance puts impossible strains on communica- 
tions. I do not mean by “communication" solely 
the transmission and reception of signals, but 
all that is implied when we say two men are 
communicating, one to the other. Those who 
think that this problem can be solved by great 
automatic data systems — and there are many 
people who think this in our industry — have no 
appreciation for the complexities of military 
operations, though they would decry the diffi- 
culties of debugging a simple computer pro- 
gram by telephone. The result of such systems 
is inevitably the inundation of the higher head- 
quarters with noise and the rapid deterioration 
of the effectiveness of the subordinate. 
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The notion is also popular that one can cen- 
tralize a portion of an operational task in a 
central agency, far removed from the realities 
that give meaning to the subtask in the first 
place. Thus, for example, major functions of 
military intelligence and military communica- 
tion are centralized for the sake of efficiency. 
But the sources of feedback and the measures 
of relevance germane to the operational realities 
and requirements are missing in such central- 
ized agencies. The relevant measures, heard 
only indistinctly, are all too easily replaced by 
feedback from the local environment which are 
all too often at cross purposes with the needs of 
the field. 

One of 'the control centers of a unified com- 
mand wished to move a telephone fi;om one side 
of an office to the other. It took two years for 
the paperwork to be processed through Wash- 
ington before official permission was given. Of 
course, the phone was moved long before that. 
But it is precisely the inconsistency as well as 
communication strains which result from such 
foul organization and failure of delegation 
with which we are concerned. 

An equally ridiculous example is that a major 
military command now is in the process of put- 
ting all. their personnel records for the last ten 
years on punch cards and sending them to 
Washington. Washington may be able to proc- 
ess punch cards very fast, but any manager 
who has had to match man to job and stay on 
top of the personnel matters of his employees 
can imagine the reduction to primitive levels 
that managers on the scene will be forced to 
employ and the stratagems that will be used to 
keep Washington permanently misinformed. 

To delegate operational tasks to men on the 
scene calls for attention to the kind of tasks 
which are likely to occur, the kinds of control 
which must be maintained at higher headquar- 
ters and strategic considerations. These are the 
outside aspects of command and control. But 
once these are properly assessed, the inside 
principle must be followed : organize and dele- 
gate to minimize the volume and maximize the 
relevance of communication. And the key to 
this principle is to assign responsibility and 
prerogatives to the man on the scene. 


To Stay Consistent, Stay Close to Reality 

One of the greatest deteriorating factors in 
any organization is incipient conviction on the 
part of a constituent that others with whom 
they must deal are out of touch with the reali- 
ties of the situation. If the higher military 
headquarters appears to be functioning on the 
basis of assumptions that the field knows by its 
direct observation to be false, or even if the 
field only believes them to be false, the field will 
inevitably respond by subverting the informa- 
tion structure to its own ends. If the higher 
headquarters loses confidence in the meaning- 
fulness of the reports it receives from the field, 
it will ignore those reports in making its de- 
cisions, The result is rapid collapse of com- 
munication, a collapse that cannot be detected 
in bit rates. The problem of maintaining touch 
with reality is greater now than ever before, 
for we have so few bench marks on which to 
lay our conceptions of military operations. We 
had not had operational experience under full 
scale war time conditions with weapon systems 
and unit organizations with which we are now 
equipped. There is no underlying body of ex- 
perience that, among other things, assures the 
consistency of view that is so necessary to main- 
tain the cohesion of the military organization. 
Thus we must insist with uncommon concern 
and tenacity that the operational realities of 
which we are cognizant are kept firmly in mind, 
let the chips fall where they may. 

In many cases, the knowledge of our senior 
military officers concerning what is actually 
going on in their commands is simply inac- 
curate or misinformed. It is generally accepted 
that the SAGE command and control system 
led the way in the development of modern 
automated command control techniques; that 
SAGE was the great prototype of automated 
command and control. The indications are, 
however, that many SAGE sites are not ex- 
ercising their equipments from one inspection 
to the next, that some have even put cellophane 
overlays over their scopes and returned to 
essentially manual operation. What confidence 
can we put in the development of future com- 
mand and control systems when SAGE is 
named as the great prototype ? 

But it is not only that our higher headquar- 
ters are net looking. Of much greater concern 
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is that we are ignoring at all levels realities all 
too apparent to the analyst who will but un- 
flinchingly lay out the known facts. How many 
systems, organizations, bureaus and military 
commands live on when both they and their 
missions are obsolete? What is the degree of 
collusion in the conduct of military exercises? 
What studies have been buried? Who has not 
felt too often the pressure to justify rather 
than establish, and witnessed too many con- 
trived demonsrations? 

We are not looking; we are not probing be- 
low the surface. But we have found a substi- 
tute: the operations analysis that purports to 
‘‘simulate'’ military operations in a computer 
or establish formulas for determining the cost 
effectiveness of weapon systems. Operations 
analysis can be useful, but only if soundly estab- 
lished on adequately developed experimental 
evidence from the field. 

It is important that those who are applying 
the results of such analyses at the strategic level 
understand our technical deficiencies in the 
basic and crucial assumptions of these studies. 
A few specific examples of such deficiencies are : 

• there is no adequate method of discounting 
over the long haul in cost/eflfectiveness 
studies ; 

• economic comparisons or economic re- 
source cost estimates involving the Soviet 
Union depend on an adequate analysis of 
the ruble/ dollar ratio, which has proved 
intractable to date ; 

• the “soft" effects of nuclear weapons on 
population, communications and organi- 
zation appear to dominate the “hard" 
effects, yet we have no adequate means be- 
yond intuition to project these “soft" 
effects ; 

• there is no adequate method to assess the 
effectiveness or the effects of deterioration 
of combat communications of any of the 
military or diplomatic services. 

Yet these deficiencies lie across the very roots 
of every major operations analysis today. 

Eight years ago I participated in a study to 
determine whether the results of a certain class 
of effectiveness studies were sensitive to the 


underlying parameters of the studies. We 
found that the degree of sensitivity, relative to 
our knowledge of the parameters, reduced the 
significance of the results to a point where their 
usefulness was in grave question. I recently 
examined this same area to see what improve- 
ments had been made. I not only found the same 
practices, but the same reports which I had pre- 
sumably established as inadequate were quoted 
as the basis for current studies. 

Is it not the case that our studies grow from 
roots that penetrate only as far as the intuition 
of the analyst? One of the best known and re- 
spected operations analysts once showed me his 
notebook where he wrote down prior to under- 
taking a study what results he expected to 
prove. He was very proud that he had never 
failed to predict his results correctly. If I were 
to be asked to choose between the expertise of 
a Ph.D. mathematician and a flag officer of our 
armed forces in matters of mathematics, I 
would pick the mathematician. As a mathe- 
matician, I do not have the arrogance to hold 
that the mathematician's “guesstimates" and 
intuition should hold sway in the arena of mili- 
tary operations. 

I am not here decrying lack of objectivity as 
such, or pointing in alarm at malpractice. What 
I am saying is that if we are to reverse the 
catastrophic fractionization of context which is 
occurring in the Department of Defense, we 
must insure by all available means that con- 
sistency which can legitimately be found. The 
best source of consistency is reality itself. We 
must get out of our nonprofit ivory towers, ad- 
ministrative labyrinths, and self-mesmerizing 
brochures. We must insure that our context 
matches the reality that is there, if we bother 
to look. Experience must again be valued, the 
man on the scene respected, and the hunch 
heeded that tells us we had better drop in un- 
expectedly to get a first-hand look. 

The Importance of Purpose 

Modern communication theory teaches us 
that information can be measured in terms of 
our relative uncertainty concerning a set of 
alternatives. What it does not tell us is where 
these alternatives come from in the first place. 
Once the alternatives are clearly defined, then 
that part of a communication which does not 
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properly reduce our uncertainty is identified as 
noise. However, if we were to change our selec- 
tion of alternatives, what was once noise may 
become highly informative communication. 
Thus the selection of alternatives is a crucial 
aspect of obtaining information. 

However, it is in the selection of alternatives 
that we often go astray. It is far too easy, when 
we come across a new facet of a situation, to 
either make simplifying assumptions related to 
the techniques of the analysis rather than the 
problem, or to simply ramify the alternatives, 
establishing new requirements for data. In 
either case, communication theory gives us 
neither reason nor warning for the resulting 
dilution of relevance. We pay homage to sys- 
tematic methods, and, like the physicist who 
ignores viscosity in favor of Newton's laws, 
forget about the sticky aspects. We must be 
sure to get all the hard facts; to distrust — in- 
deed fear — the man who is outspokenly biased. 
We dare not make judgements of relevance, and 
the only limitations on alternatives are those 
made in favor of allowing more precise analysis. 
And as a result, we are faced with the paradox 
that since there is nothing we dare call noise, 
it is all noise. I know of one office of the De- 
fense Department that has stored away over 
7000 magnetic tapes, each containing a million 
and a half words. They never have and never 
will have the capability to look at them, yet they 
are reluctant to throw them away. 

The selection of alternatives is where our 
purpose is revealed, where our measures of 
relevance are established. It is only when there 
is clear purpose that there exists the means for 
establishing relevance. And it is only through 
a sense of relevance that we can winnow out 
the essential elements of information from the 
swirling, encompassing chaff of data and re- 
ports. When we have our purpose clearly in 
mind, then the one or two alternatives that we 
actually have stand out clearly in front of us. 

What is purpose? Purpose is not something 
set down in writing under the subhead, objec- 
tives; nor is it the first few paragraphs of a 
justification paper. I imagine that each of us 
has on occasion sat with tablet and pencil before 
us trying to put into a position paper or pro- 
posal a meaningful statement of purpose, and 


finding our words empty and unconvincing. 
Purpose is not expressed that way. 

Purpose is only truly meaningful to the man 
on the scene who has at his command the means 
of accomplishing something and is caught up, 
engrossed, in getting a task done. Such a man 
is not objective ; indeed, he is vehemently biased. 
But this is the kind of bias we so desperately 
need. He ignores most of the facts, and in so 
doing has time to attend to his job. He is too 
preoccupied to fill in all the data forms; but 
when he communicates, the message both de- 
mands attention and clarifies by its clear 
relevance. 

Purpose is the man on the scene, struggling 
with a task, who can get direct feedback from 
his actions, who has at hand the measure of his 
accomplishment or failure. I do not think you 
can separate the two. Separate the man from 
the means for accomplishing his task, and pur- 
pose goes also. Remove him from the scene, 
from the source of direct feedback, and you re- 
move purpose as well. 

In designing information systems for com- 
mand and control there seems to be no atten- 
tion given to conveying purpose. How is pur- 
pose conveyed? Purpose is conveyed by saying 
what the alternatives are, not by evaluating 
their cost effectiveness. Purpose is conveyed by 
saying what the measures of effectiveness will 
be. Purpose is conveyed by asking questions, 
not for the sake of the answer, but to be sure 
the man questioned got the gist of your order. 
Purpose is conveyed by leadership and follow 
through. 

The systemization of human knowledge has 
grown to such an extent that we find ourselves 
entrapped in a great web in which it seems that 
any strand we pull ties in with all the others 
tugging us in ways we did not anticipate. We 
tend to replace the question as to which way we 
wish to go by the question as to which way we 
can go. And we assume this latter question can 
be solved by simply getting our facts untangled, 
tracing through the scientific web of cause and 
effect, and thus clarifying the old alternatives 
and their cost effectiveness. 

The realities with which the military must 
deal today are beyond such scientific explana- 
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tion, I am not talking about the ‘‘intangibles/’ 
nor about the role of “judgement/’ nor any of 
the other cliches used to justify intuitive be- 
havior. The self-acceleration of context, in par- 
ticular of our scientific/military context, has 
reached the point where changes are taking 
place in the system faster than our ability to 
properly take them into account. I have stressed 
above that fractionization is occurring, and the 
essence of fractionization is that overall 
rationalization of the total system becomes 
more and more impossible. 

If science is inadequate for the direction of 
our military programs, by what should we sup- 
plement science? Under these conditions of 
fractionating context and decay of overall in- 
formativeness we must find some means to re- 
establish relevance and spur and inspire pur- 
poseful action with a unity of direction. Our 
current attempt to gather all possible informa- 
tion into one place and then use scientific 
methods to reduce it to a point where decisions 
can be made is resulting in just the opposite of 
what is required. 

Indeed, the very cry of our operations 
analysts for clearer statement of requirements 
points up the almost total loss of measures of 
relevance and sense of purpose. 

Let us recognize that some men have a better 
sense than others for where we are going and 
what can be done about it ; and that some men 
have the capability to inspire and lead. Just as 
we look for the finest intellects to become 
scientists, we should similarly look for and 
esteem our men of judgement and our men of 
leadership. How much money are we spending 
today to encourage those who rate high on the 
college board examination compared to the in- 
centive we are giving to courage of conviction, 
personal initiative, and the qualities of leader- 
ship ? We do not breed leadership by centraliza- 
tion of decision making, by defiling the image 
of those who have achieved prominence as 
leaders and men of judgement by casually re- 
versing their decisions, by giving precedence to 
the mathematically encapsulated intuition of 
men whose judgement has never been tested in 
operational situations, by occupying the time of 
our field officers with routine paperwork and 
the exclusion of honest tests of their effective- 


ness in successful operation and sustained oper- 
ational preparedness. 

It is in the practices of command and con- 
trol that we determine the quantity and the 
quality of judgement and leadership that we 
shall be able to call upon in the future. It is the 
practices of command and control that allow 
the quantity and quality of judgement and 
leadership we now have to be exercised effec- 
tively. And it is the quantity and the quality of 
judgement and leadership that will determine 
whether the exploding context will fractionate 
in an orderly fashion, giving us the defensive 
capability that will sustain our great country, 
or will hasten the catastrophic fractionization 
that is now rotting our defensive capability 
from within. 

To summarize: 

• the requirements for communication must 
be minimized and the relevance of com- 
munication must be maximized by placing 
the prerogatives of command and the re- 
sponsibility for reporting upward in the 
hands of the responsible official on the 
scene ; 

• the interconsistency of context must be 
maintained by keeping close to reality, by 
delegation to the point where feedback 
from the real world can be felt, by follow 
through in insuring that things are as they 
are reported to be, and by facing up to 
realities that we know but are now ignor- 
ing; 

• relevance and purpose must be stimulated 
by attention and esteem for judgement and 
leadership and by the stimulation of these 
qualities by allowing them to be exercised 
at all levels of command. 

These three principles are the central issues 
of command and control today. Our attention to 
them will mark the success or failure of the 
establishment of effective command and control 
across our military organizations. Do not be 
misled that the only problems that remain are 
how to automate decision making, store and 
retrieve documents or harden communication 
centers. Let us turn now to these central issues 
and what our industry can do to resolve them. 
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IMPLICATIONS FOR THE COMPUTER 
INDUSTRY 

Exploitation of the Adaptability of the 
Computer 

The development of the digital computer has 
provided us with a machine whose combination 
of adaptability and computational power is way 
beyond anything previously available. As a 
means of manipulating great quantities of data 
and carrying out extremely complex data proc- 
essing, the computer opens whole new avenues 
by which we can bring to bear distant observa- 
tions on current concerns. Furthermore, the 
high speed, stored program computer has the 
potential of placing these capabilities at our 
fingertips. 

However, the exploitation of these two great 
virtues of the computer cannot be carried out 
independently. There is a trade-off between the 
computer as an adaptable instrument and the 
computer as a data processing instrument. To 
make the most efficient use of the computer as 
a data processing instrument requires efficient 
programming, minimizing of housekeeping and 
input-output functions, and careful scheduling 
of the computer's utilization. To make the most 
efficient use of the adaptability of the computer 
requires, in contrast, that the computer respond 
directly to the sometimes ill thought out instruc- 
tion of the user, with much time spent on input 
and output to deal with the elaborate re- 
dundancies and conventions by which we nor- 
mally communicate, and that the computer es- 
sentially wait for the user rather than the other 
way around. 

Now the principal uses of the computer today 
are in applications where the maximization of 
its data processing capabilities are required. It 
is indicative to recall that the biggest sources 
of funds by far for the development of com- 
puters have been the Atomic Energy Commis- 
sion and the data handling aspects of military 
intelligence. In both cases the emphasis is al- 
most entirely on speed of processing. Even the 
output required is in the concise and sophisti- 
cated language of mathematics. No wonder the 
current generation of programmers and com- 
puter engineers place efficiency of computation 
and efficient use of the computer ahead of all 


other measures in deciding the appropriate way 
to apply computers. 

Further, when the problem that the computer 
programmers and computer engineers see are 
so very large and complex and so distant from 
the underlying uncertainties giving rise to the 
computation, the idea that the data are suspect 
and the answer only an indication of what new 
question to ask, immediately requiring a new 
program, or second guess at inputs, is foreign, 
indeed. Thus the adaptability of the computer, 
which makes it such an ideal tool of conceptual 
exploration, has been downgraded. Little has 
been done until recently toward exploiting the 
other half of the computer's great capabilities. 
Indeed, relative to total expenditures or total 
population in our industry, little is being done 
today. 

In particular, our command and control sys- 
tems are not responsive to the rapidly changing 
environment, capabilities and objectives of the 
user. In fact, the principal result of all the push 
in command and control to date is the prolifer- 
ation of organizations inserted between the ulti- 
mate user and his computer. There is the EDP 
office of the using command, his counterpart in 
the procuring command, the system monitors, 
the contractor's project office and finally the 
programming and design people. It is in the 
interest of each of these offices to seal the sys- 
tem requirements years before its operational 
date and see to it that the most efficient — and 
thus necessarily the most immutable — configu- 
ration of programs and equipment are provided. 
For example, soon after the initiation of one of 
the Air Force L systems, it became evident that 
the requirements stated in the request for pro- 
posal (by then 15 months old) did not take 
cognizance of the user's trends and plans for 
reorganization. The system contractor duly 
recommended a rather complete reorientation 
in his effort. Not only was this recommendation 
ignored, but 10 months later, 6 months after a 
reorganization of the user that made the work 
statement obsolete, the system contractor was 
required to deliver a detailed systems design and 
establish by detailed analysis that it filled, but 
did not go beyond, obsolete requirements of a 
nonexistent organization. 
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In the first part of this talk, I stress the grow- 
ing problem of catastrophic fractionization of 
the context of the Department of Defense. I am 
convinced that a by no means insignificant con- 
tribution to this impending catastrophe is being 
made by past and current efforts in command 
and control systems. The huge, inflexible data 
systems with their insatiable demands on the 
field for irrelevant, often unused data, the cum- 
bersome automation of decision processes 
whose development time cannot keep pace with 
changing requirements are contributing not 
only to the inflexibility of our defense posture, 
but immeasurably contributing to the incon- 
sistencies they are supposedly resolving. There 
is nothing easier to corrupt than a complex, 
highly formalized data system. And there is 
nothing for which the coercions to corruption 
are stronger. I shall never forget the conversa- 
tion I had some years ago with one of the lead- 
ing contributors in our industry. At the time 
he was at a Naval shipyard, in charge of their 
computer installation. He^ described the prob- 
lem, there as having nothing to do with efficient 
methods of inventory control or production 
scheduling. Rather their then current data sys- 
tem had been so completely corrupted at the 
grass roots level that one could place no reliabil- 
ity on any available data, and as a consequence, 
the yard management was in a very real sense 
helpless in face of the working level control of 
shop foremen. How much farther has this gone 
today? 

The real damage, though, is not being done by 
the command and control systems that are 
working; there are too few of these. It is the 
great promises and strange delusions that in a 
centralized data system we have the panacea of 
all our command and control problems. Mr. Gil- 
patric, when Assistant Secretary of Defense, 
said, “The machinery for gathering, analyzing, 
and presenting the data necessary for decision- 
making has, due largely to the extensive and 
imaginative use of automatic data processing by 
the military, advanced to a point where central- 
ized decision-making is both efficient and effec- 
tive.’’ Mr. Esterly Page, Director, National Mili- 
tary Command System (Technical Support), 
repeating the above quote from Gilpatric, went 
on to elaborate on the choice that has been 
made: “During the past two years this nation 


has undergone a basic change in the philosophy 
of command and control which has resulted in 
new emphasis on the requirement for large 
quantities of finite and precise information to 
form the basis for consideration at the highest 
level of problems that previously were con- 
sidered and unfortunately decided at a much 
lower one.” Like the wooden walls of Athens, 
computers may be the key to our survival. But 
what happens when those high up in the lonely 
citadel find they chose the wrong wooden walls, 
and the ones they chose have been rotted away 
by the proper corruption which seeps in when 
the traditional coercions for responsibility have 
been taken away? 

However, there is a bright spot in this other- 
wise disturbing picture. The Advanced Re- 
search Projects Agency is supporting a number 
of well-chosen programs for the development 
and exploitation of the adaptive capabilities of 
computers. Certainly one of the most ambitious 
of these programs is Project MAC, undertaken 
by the Massachusetts Institute of Technology. 
The Air Force is also turning toward develop- 
ments along somewhat similar lines, as was 
stated by General Terhune^ at the Fall Joint 
Computer Conference in Las Vegas.* There 
are, as well, programs sponsored by other 
agencies, both public and private. For example, 
the commercially supported project undertaken 
by the Control Data Corporation concerning 
computer supported engineering design. 

These projects have as their general objec- 
tive making the computer directly available to 
the individual researcher, analyst, and staff 
officer, as an adjunct to his creative thinking. 
The technical objectives are threefold. First, 
by means of remote consoles, interrupt features, 
and associated software, to provide a number 
of users direct and parallel access to the com- 
puter. Second, by means of problem oriented 
languages, including graphic and natural lan- 
guages, to provide access to and response from 
the computer in a form that is natural and effi- 

* This point of view was anticipated by ^‘A Concept 
for the Navy Operational Control Complex of the 
Future,” by R. W. Callan and F. B. Thompson,! and 
‘‘Fundamentals Underlying Military Information Sys- 
tems Design,” by F. B. Thompson, a paper presented 
at the First Congress on the Information Systems Sci- 
ences, November 20, 1962.2 
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dent for the user. Third, by organization and 
advanced concepts of software logic, to pro- 
vide more and more intelligent responses to an 
increasing variety of possible requests. These 
capabilities will take the computer from down 
the hall and behind the counter, and place it 
as a direct adjunct of the staff officer. 

Command and Control Implications of Direct 
Access Systems 

The general nature of these technological de- 
velopments is probably known to you. What I 
should like to bring into sharp focus here is the 
implications these technological developments 
hold for the central issues of command and con- 
trol. A computer as a direct adjunct of the staff 
officer is a very different thing than a computer 
as the central instrument of a great data sys- 
tem. Those who speak of direct access systems 
often describe how they will allow the staff offi- 
cer to make use of some central data base, and 
the implication seems to be that the staff officer 
will have direct access to the great assemblage 
of fact that now stands assembled but inacces- 
sible by the great data system. This is a false 
view. Access to a base of fact can be meaning- 
fully accomplished only if the user has a feel for 
the relevance, the source, and the units of those 
facts. In a very real sense, they must be his 
facts, his data. We too often assume that the 
number of missiles in Cuba, or the number of 
atomic weapons we could deliver to enemy tar- 
gets are sufficiently well defined when in fact 
their uncritical application by the uninitiated 
is fraught with danger. 

This point needs to be stressed. Meaningful 
data is a fragile, subjective thing. The intelli- 
gent analyst knows this when he insists on 
knowing the use that is going to be made of his 
estimate before he is able to produce it. The 
contracting officer knows this when he refuses 
to allow even a unilateral passage of informa- 
tion during deliberations on a source selection. 
The combat commander knows this when he 
gets out among the troops to sense the situation 
on the spot. One can reasonably and respon- 
sibly use data only if it is one's own data. 

There is a second aspect of this matter of the 
data base underlying these direct access sys- 
tems. The amount of data that could be ac- 


cumulated in, say, a major command head- 
quarters is astronomical. The choice of just 
what data is to be found in the system thus be- 
comes an important command decision in itself. 
That choice embodies the sense of relevance of 
the headquarters. Further, the implementation 
of that choice by directives on subordinates and 
requests to lateral and higher headquarters, is 
a principal means of conveying purpose and 
establishing direction. The headquarters whose 
reporting requirements are static and all- 
inclusive gives up one of its principal tools of 
control and opens the door to subversion of its 
leadership. It is the direct access system that 
can give new vitality to data as a sensitive in- 
strument of command and control. 

Therefore, the implication of the technologi- 
cal development of direct access systems is that 
there will be closely knit staffs addressing rela- 
tively small but dynamic bases of highly rele- 
vant data as they deal intensively and purposely 
with problems local to their scene. Such direct 
access systems cannot be reasonably justified 
on any other grounds. 

Thus the technological development of the 
computer as an adaptable instrument, respond- 
ing directly to the requirements and local con- 
text of the user marks the beginning of the 
counter trend. Not only are the continuing 
failures of the automated information systems 
becoming more and more obvious, but the seeds 
of a new . type of system suited to the needs of 
the purposeful headquarters on the scene are 
being sown. The catastrophic fractionization 
of the Defense Department is being accelerated 
by the obsolete centralized information systems. 
It is the latter, the system that can be used re- 
sponsively by the purposeful staff, that can 
bring the necessary vitality and responsiveness 
to orderly fractionization. 

The Promise and the Needs of the Highly 
Responsive Staff 

Throughout this paper, the underlying prob- 
lem has been how to deal with the exploding 
military context. We have emphasized (1) the 
need to place the prerogatives of command and 
responsibility for reporting upward in the 
hands of the responsible officer on the scene; 
(2) the need to maintain a direct hold of 
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reality; (3) the need to establish purpose and 
measures of relevance through leadership. How 
can this be done, say in the environment of mili- 
tary command? 

The essence of orderly fractionization is that 
the units of command must be in a position to 
maintain a viable context, a cohesive and con- 
sistent view that integrates their capabilities, 
environment and objectives in a way that is 
responsive to the realities with which they 
must deal. The commander and his staff must 
know what he wants to do, what it will take to 
do it and what he has to do it with, and they 
must have a sense of readiness that his objec- 
tives, his requirements and his capabilities add 
up with possibly a margin of capability in re- 
serve. No one of these three ingredients of his 
context — objectives, situation and capabilities 
— is given to him immutable. He must be able 
to modify his detailed objectives, reassess his 
situation, reorganize and redeploy his capabil- 
ities. That is, he must maintain the consistency 
of his context of command. But more than that, 
he must make sure of this intelligence, test and 
exercise his forces, prod and react to his supe- 
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text of command. This maintenance of context 
is the essence of staff work and the foundation 
of command and control. The orderly frac- 
tionization of command is tantamount to the 
establishment and reestablishment of a hier- 
archy of command in which each headquarters 
can maintain such a context peculiar and ap- 
propriate to its position in the hierarchy. 

As the rate of expansion of overall context in- 
creases, this hierarchy of command must ex- 
pand and deepen. Each element in turn must 
increase the efficiency and maintain the rele- 
vancy of its context. If this is done there will 
indeed be fractionization, for the contexts of 
forward elements, in touch with the rapidly 
shifting local situations, will responsively shift 
and move in ways the contexts of higher ele- 
ments cannot moment by moment respond. 
These higher elements, however, operating at 
higher levels of abstraction, will follow at their 
appropriate rate, their appropriate realities, of 
their appropriate scene. Fractionization of the 
overall context inevitably will occur. 


Let me restate that in the bluntest possible 
terms : In the Department of Defense today, in- 
consistencies between the contexts of the vari- 
ous elements of the organization are inevitable. 
It is the character and source of these inconsist- 
encies that must be controlled. If that frac- 
tionization is realized in terms of the responsive- 
ness of the disparate elements of the command 
hierarchy, it will accommodate greater and 
greater responsiveness to change, and permit an 
orderly control of a vast and vastly informed 
organization. Consistency of overall context 
cannot be maintained in the face of change. But 
orderly fractionization of context that places 
the prerogatives of control at the point where 
the sense of change is most meaningful, that in- 
sures the degrees of change will be felt at the 
appropriate level and that inserts purpose into 
change, such orderly fractionization can permit 
our organizations to live and grow and prosper. 

In Summary 

Thus the crucial informational problem is 
the maintenance of relevant, consistent, valid 
context in each separate, disparate organization 
or military headquarters. This is the central 
task of staff work, the essence of command and 
control. It is in the perspective of this task that 
our information systems must be planned and 
developed. The appropriate information sys- 
tem must be a direct tool of staff, as natural and 
direct to use as the telephone. The data held 
by the system must be put there by the staff, 
must be the intimate changing product of the 
staff, the factual base of their context whose 
relevance is their most crucial task. And that 
system must be the staff's system, not a uni- 
versal system, not a standard system, but one 
that is recognized as what it is, inconsistent 
with all others, to a degree irrelevant to all 
others, and thus free to be responsive to the 
sense of relevance of its staff, the purpose of its 
commander or administrator. From such a con- 
text, communication will be relevant. And this 
communication, by its very shifts and changing 
emphasis, will convey purpose and direction as 
well as fact. 

This is a very different kind of system than 
the great data systems and automated control 
systems. It is a system that allows direct access 
in natural language, it is a system whose files 
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and formats, whose inputs and vocabulary are 
always changing, continuously changing. 

The Department of Defense is tending to- 
ward catastrophic fractionization. The reversal 
of the process of this explosion may still be pos- 
sible. It rests in the hands of two groups: 
First, those in control of defense policy, in par- 
ticular those who are determining the policies 
for military command and control. Second, it 
rests on us, for it is our industry that can turn 
to the development of systems that will exploit 
the adaptability of computers and will place 
them as the tools of staff in maintaining rele- 
vant, responsive contexts of command. The 
challenge to realize orderly fractionization is 
very real and urgent today, a challenge to which 
we must respond. 
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SOME OBSERVATIONS CONCERNING LARGE 
PROGRAMMING EFFORTS 


Almon E, Daniels 
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INTRODUCTION 

The requirement to construct large systems 
involving both men and computers to assist in 
the evaluational and decision making processes 
of command and control leads very quickly to 
the development of large and complex computer 
programs. While my own efforts have been 
largely devoted to the development of detailed 
computer simulations of strategic air war- 
fare/ I have had an opportunity to follow in 
more or less detail the progress on a few of 
the command and control systems. The observa- 
tions which are made here relate to a broad 
class of computer data systems and certainly 
apply to the ones evolving in the command and 
control area. 

One may think of a computer program as a 
progression of smaller programs or routines. 
Among these routines are ones which alter data 
presented to them, perform calculations, make 
new arrangements or combinations of the data, 
provide for employing alternate paths in the 
program, etc. Of considerable importance also 
are the control routines which bring together 
at the proper time the correct routine with the 
data on which it is to operate. In more com- 
plex programs, the sequence of use of the 
routines and data is subject to modification by 
external events. 

The tougher problems in the design of com- 
puter programs begin to arise when the in- 


ternal memory of the computer is too small to 
contain all of the routines and data involved. 
For the large man-machine systems the internal 
memory might be likened in size to a teacup 
used to dip a mixture of program and data 
from a barrel. Much larger memories will 
simplify some of the problems, but they will 
not eliminate nearly all of them. 

These remarks have been made to clarify, 
for the present purposes, the meaning of the ex- 
pression ‘‘large computer program.” 

The development of a large system goes 
through several phases. First, there is a for- 
mulation of system requirements. Then fol- 
lows a period of system analysis, problem defi- 
nition, and design. The computer hardware is 
selected, a detailed design is made and im- 
plementation is undertaken with numerous 
changes and delays. Experimental applications 
are made, redesign occurs, and modifications of 
the computer program are made. Further 
trials are made, and the cycle is repeated. 

The elapsed time for the accomplishment of 
these phases is measured in years. The design 
and implementation of most of the Air Force 
“Big L” systems have extended for three to 
five years. The other services have faced similar 
schedules. Under very favorable circumstances, 
a substantial application can be brought to a 
productive state in about two years. 
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DESIGNERS' DILEMMA 

Designers of large data systems begin at 
condition A and envision a solution B, hurdling 
a number of intermediate steps. Because of 
the anticipated elapsed time of three to five 
years, embarking on the task of attaining 
solution B without intermediate steps requires 
the acceptance of a fairly bold assumption. It 
has to be assumed that solution B, which has 
been arrived at in terms of present day prob- 
lems, will in fact be suitable for the intended 
task when the system is operational. In other 
words, the assumption must be made that the 
problem and the solution will remain relatively 
constant. Experience has shown that this 
happy state does not exist. The more distant 
the goal, the less certain the solution becomes. 
Furthermore, because the problem exists at 
condition A, there is pressure to arrive at a 
useable solution as soon as practicable. Con- 
sequently, as work progresses, several compro- 
mises are made. Call these less elegant solu- 
tions in succession Bi, B2, and B3. As time 
passes, additional knowledge is acquired, and 
the actual requirements of the problem change 
to B' while the compromise solution is still B3. 
A person not troubled by the esthetics of pure 
design could set out toward the not-crystal- 
clear goal B by making the step from A to A' 
(which may be rather close to B3) with much 
greater chance of early accomplishment of the 
lesser goal. 

Our experience with computer applications 
verifies the comment of one keen observer in 
the profession: ‘‘When we cease to change a 
program, we cease to use it." 

If there is any conclusion to be drawn from 
considering these two ideas together, it is the 
following: Make as complete a system study 
as possible, but design the implementation in 


such a way that a portion of the system can be 
exercised usefully at an early date. One must 
attempt to build the system and its modifica- 
tions so that the user can operate the system 
while its usefulness is being extended incre- 
mentally. It is almost inevitable that exhibit- 
ing sample products of the system will lead to 
changing the design of the system; and the 
sooner this restatement of the requirement 
occurs, the sooner the first round of modifica- 
tions to the system can be undertaken. Changes 
will be required on a continuing basis as the 
users of the system better understand what the 
system can do for them. 

USER PARTICIPATION 

It is next to impossible for the system 
analysis team to discover the full extent of the 
system being studied or to find all the boundary 
conditions unless the cooperating representa- 
tives of the user are sufficiently experienced in 
both the operational and computer aspects of 
the application to exhibit all the pertinent 
facets of the problem. The distinction is here 
made between the ultimate user of the system 
and his representatives who are assigned to 
assist with the design and incremental imple- 
mentation of the system. The backgrounds and 
perspectives of the representatives may result 
in the incorporation of some undesirable 
features in the system, if they are not able to 
project their thinking to the needs of the 
ultimate user. If the ultimate user can be in- 
volved in the early exercising of the incom- 
plete system, clarification as to emphasis and 
detail should occur before the whole design is 
implemented in an unsatisfactory fashion. 

Despite the possibility that guidance from 
representatives of the ultimate user may in- 
troduce some ideas into the design which will 
have to be modified at a later date, the partici- 
pation of carefully selected representatives of 
the user throughout the design, implementa- 
tion, and testing of the data system is vital. 

Without such participation, the user's organi- 
zation will be ill-prepared to understand the 
series of compromises which have been made 
to bring the system to an operational state, and 
will not understand that asking for too much 
in a hurry is likely to delay the completion of 
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the initial portion of the system and its in- 
cremental extensions. 

The user's personnel must be able to under- 
take the maintenance and modification of the 
system after the team from outside the organi- 
zation has completed its work. In order to 
supply this capability, the user organization 
must provide sufficient personnel of suitable 
talent to participate in the implementation of 
the system. These participants must be deeply 
involved in translating the design to language 
acceptable to the computer, debugging the 
routines created, developing test problems of 
proper sophistication, producing the docu- 
mentation, etc. As the key personnel assigned 
this duty may require two years or more to 
become proficient in substantial portions of the 
system, care must be taken to assure continuity 
in their assignments. Until the problem of 
rapid turnover among .computer programmers 
becomes less acute, it is likely that most of 
the more stable personnel involved will be 
among those in the employ of the using organi- 
zation. 

TESTING 

For computer programs of considerable size, 
it is useful to distinguish three levels of testing. 
The programmer devises a test or tests for 
each subroutine he builds in order to assure 
that the computer reacts as anticipated to all 
the circumstances he considered in designing 
the subroutine. 

As these building blocks are completed, they 
are ready for testing in combination with other 
subroutines and other aspects of their intended 
environment. Their input and output conven- 
tions are checked for compatibility. Larger 
test procedures are devised to exercise the 
joint functions of the program complex. Ap- 
propriate adjustments are made in the pro- 
gram until all anticipated results are pro- 
duced. 

The third stage of program testing^ con- 
sists of putting the program into limited op- 
eration to observe its behavior when dealing 
with the many variations of data which arise 
in practice. It is very difficult to devise test 
problems which employ all the possible paths 


in a program. Even though the program re- 
sponds correctly in a limited test, it is possible 
after several weeks of successful operation for 
the program to encounter some new combina- 
tion of inputs against which it was not ade- 
quately protected. 

DOCUMENTATION 

Documentation of the system created in one 
of these three-to-five-year projects is an im- 
portant though time-consuming endeavor. The 
description of the program in the language ac- 
cepted by the computer is a key part of this 
record and must be kept up to date as program 
changes are made. This level of documentation 
is the “court of last resort" in isolating pre- 
cisely what some routine does, but the technical 
remarks or diary prepared by the programmer 
while he has all the facts in mind will reduce the 
amount of detective work which must be done 
some months later when an unanticipated 
change must be made. Two or three other 
descriptions of the system may also be re- 
quired at decreasing levels of detail. 

Adequate documentation is important to the 
person who created the computer program, as 
even he soon finds the maze of detail confusing 
without rather complete records. The person 
who creates one program is never free of it 
unless he makes it possible for others to modify 
his program without performing major re- 
search. The documentation must be generated 
so that a given programmer may be assigned 
to a different project. This documentation is 
essential, for the protection of the user, as the 
programmer may at some critical time no longer 
be available for consultation. 

SIMULATION 

Simple exercising of a large man-machine 
data system with the use of a single bank of 
information in a fixed pattern will bring into 
focus the utility of such a system to aid mili- 
tary decision makers only under rather limited 
circumstances. A simulation technique should 
be superimposed to alter the data flow accord- 
ing to various assumed alliances, postures, 
strategies, or the like. Such simulation exer- 
cises of the system in the operational environ- 
ment by users of appropriate echelons will give 
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them synthetic experience about situations 
which have not been encountered in sufficient 
detail in any other medium. Only by exercising 
the system with data flowing at an accelerated 
pace will many of its weaknesses be discovered. 
The simulation must, of course, incorporate 
uncertainty, conflicting information, degraded 
communications, etc., if realism is to be ob- 
tained. 

Considerable effort has already been devoted 
to the development of detailed computer simu- 
lations of the air interactions in a nuclear ex- 
change.^ It would be hoped that these simula- 
tions might be used to generate under various 
assumptions much of the information required 
to alter the behavior of the man-machine data 
system. 

It should not be assumed that providing the 
simulated environment to influence the large 
data system is a small undertaking. Depending 
upon the detail introduced, the project may be 
even larger than the original one. 

Thus far this discussion has dealt with the 
notions of system design, incremental imple- 
mentation, inevitability of modifications, user 
participation throughout the project, testing at 
three levels, integrating documentation with 
the development of the program, and the use- 
fulness of providing a simulation environment 
for the data system. 

It also seems appropriate to record some addi- 
tional remarks based on participation in or 
observation of several large programming 
efforts. These remarks will range over the 
topics: level of effort, estimation of time re- 
quired, hardware considerations, reaction to 
frustration, and programming systems. While 
these topics are not clearly related, the com- 
ments are intended to provide at least a partial 
check list for those who are brave enough to 
undertake the development of a large computer 
program. 

LEVEL OF EFFORT 

It requires great skill and unusual circum- 
stances for a large system of programs to ar- 
rive at a satisfactory operational state in less 
than three years. Even though the period since 
1955 has provided several instances of this bit 


of truth, every new group which embarks upon 
a new project is dominated by persons who be- 
lieve that they can avoid the mistakes made on 
other projects and shorten their own schedule. 
To some extent they do profit from the experi- 
ence of others, but they usually make a few new 
mistakes of their own. 

It is important to understand that a small 
team of four or five experienced analysts can 
accomplish about as much in the first four 
months as a team of twenty. Toward the end 
of the initial phase, the small team should be 
supplemented by specialists in various areas as 
required. There will be no advantage in gather- 
ing a team of 100 technicians at the beginning 
of the effort. As a matter of fact, such an action 
will probably delay the project. 

A healthy growth in the effort applied to the 
project is important. A modest beginning with 
thorough indoctrination in the early months 
of the user-contractor relationship will pay off 
in the end. 

Wide fluctuations in the support given the 
contractor will cause serious difficulties. If 
enough money is provided to acquire too large 
an increment of personnel, most of the old hands 
will be devoting too much time to training the 
new arrivals. If too little support is provided, 
progress lags and morale suffers. As new ar- 
rivals on the project will not contribute much 
for several months, the project is relatively in- 
sensitive, in the short run, to small personnel 
increases. 

ESTIMATION OF TIME REQUIRED 

Aside from the scarcity of experienced per- 
sonnel capable of performing the scientific and 
engineering tasks involved in the development 
of a large computer data system, there are some 
administrative problems which lead to fixing 
the minimum time at two years regardless of 
the complexity of the system. Among the ad- 
ministrative problems are the following : budget 
justification and funding release practices, site 
construction schedules, and equipment delivery 
schedules. 

It has been customary in some quarters to 
estimate the productivity of skilled program- 
mers at 3000 instructions per man year. When 
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a great deal of on-the-job training is involved, 
the estimate should be set at about 1000 
instructions per man year. When smaller teams 
are given the responsibility for well defined 
sub-projects, instances of 12,000 to 15,000 in- 
structions per man year have been observed. 

In estimating costs, it appears necessary to 
allow for each programmer two other persons 
classed as analysts, administrators, or engineer- 
ing support personnel. 

The more complex data systems fall in the 
100,000 to 200,000 instruction class.^ Using the 
factors just given, one can begin to estimate 
the manpower required and the cost of im- 
plementing such systems. By imposing limits 
on the size of the team to be involved and 
taking into consideration the funding schedule, 
it is then possible to obtain a first approxima- 
tion for a completion date. 

After the design is relatively complete, the 
users and designers must agree that the design 
meets the requirements of the project. After 
the detailed programming effort begins, a 
major re-orientation of emphasis which re- 
quires a redesign of a major segment of the 
system will normally delay considerably the 
completion of the project. 

Once agreement has been reached on the de- 
sign of a specific large computer program, the 
user must exercise a great deal of patience and 
forbearance. No amount of exhortation will 
materially affect the progress of the project 
and may even cause delays. The user who sched- 
ules other events very closely on the basis of 
the promised completion date of a large pro- 
gramming effort is almost certain to have 
major disappointments. 

Some attempts have been made to apply 
PERT techniques to programming efforts. Un- 
til estimating techniques in the programming 
field are more reliable and the addition of extra 
manpower on lagging subroutines has a more 
predictable effect in the short run, the develop- 
ment of a PERT network will serve only to help 
the managerial echelons to understand the re- 
lationship of the various pieces of the program 
being constructed and to make the initial as- 
signment of effort more intelligently. 


HARDWARE CONSIDERATIONS 

Rapid strides are still being made in com- 
puter hardware development, and significant 
improvements as to speed, reliability, capacity, 
and flexibility are available every two or three 
years. It has been pointed out that the design 
and programming effort for large problems 
takes a minimum of two years. 

It is hard to imagine the confusion which 
would arise if a computer, of not too well de- 
fined characteristics, were being constructed 
for a large problem which was being pro- 
grammed at the same time. First, it would 
require most of the hardware construction 
period to create the sophisticated programming 
system required to permit the large problem 
to be accepted by the computer. Further, about 
half way through the period, there would be a 
requirement to debug some of the routines of 
the large problem. 

As the problem has probably changed mate- 
rially during the programming period and most 
certainly will change as soon as the program 
has been used a few times, most persons who 
have considered these facts have agreed that 
nothing is to be gained by trying to specify 
the characteristics of a large-scale special pur- 
pose computer. If it is built to satisfy the re- 
quirements of a problem as defined now, there 
is a good chance that it will not be adequate 
for the problem as it is understood shortly 
after the computer is completed three years 
from now. 

A useful competition among the computer 
hardware manufacturers is in progress. For 
reasons which need not be considered here, dif- 
ferent types of computers are being selected 
for installation at various locations on th basis 
of current requirements with well-documented 
justifications. Much of what is to be done at 
location A on computer X may also at some 
time have to be done at location B on computer 
Y, etc. If a programming system implemented 
on computers X and Y would, in fact, accept the 
programs first written for the other computer 
with only modest changes, considerable savings 
would result. 
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REACTION TO FRUSTRATION 

Any person who has the tenacity to spend 
several weeks trying independently different 
approaches to the solution of a puzzle before 
success can appreciate in some small measure 
the frustrations of the programming activity 
and the final feeling of personal triumph when 
the last important programming bug is cor- 
rected. As the process of preparing a large pro- 
gram often extends over more than a year and 
the mistakes of many people are involved, the 
frustrations are of a higher order. In the nor- 
mal course of events, nerves become frayed, 
fingers are pointed, neuroses appear, and other 
psychological and physiological reactions are 
not unknown. A few persons on the project 
who accept the ultimate technical responsibility 
for the completion of the project will find the 
pressures unabated for a period of months. It 
is small wonder that persons, who have experi- 
enced this trauma a time or two, have fairly 
firm ideas as to the circumstances under which 
they will accept such a responsibility again. The 
completion of the project will lag unless this 
concentrated effort does occur, and the mana- 
gerial chain on the project cannot, at the be- 
ginning, predict which members of the team 
will punish themselves in this fashion. 

One of the important contributions being 
made by the more powerful programming sys- 
tems is the elimination of many of the frequent 
recording and cross-referencing errors so that 
the more important logical blunders do not re- 
main concealed for long in the debugging proc- 
ess. This alone materially reduces the length 
of the effort and permits more concentration 
on the solution of the important problems, 

PROGRAMMING SYSTEMS 

Because of the general confusion in connec- 
tion with the use of the words “programming 
system,'’ it may be helpful to distinguish (1) 
programming languages, (2) compilers, and 
(3) operating systems.^’ ® 

A programming language is the set of sym- 
bols and conventions which are designed for 
the convenience of the programmer to express 
his detailed solution to the logical, manipula- 
tive, and computational problems encountered. 


A compiler translates the expressions of the 
programming language into sequences of in- 
structions in the basic language of the com- 
puter hardware. 

An operating system augments the capabili- 
ties of the computer in such a fashion as to 
relieve the programmer of the requirement to 
provide in detail for input/output scheduling 
and assignment, memory allocation, etc., so 
that the programmer need not resolve these re- 
curring problems in numerous variations and 
can devote his efforts to implementing the data 
system design. 

The JOVIAL system includes all three of 
these components and has been implemented on 
several large-scale computers, but the versions 
of the language are not identical.'^ 

NELIAC has been widely used and consists 
of a language and a compiler. The data rou- 
tines are incorporated during the compilation. 
This technique has permitted the larger com- 
puters to compile programs for computers with 
memories too small to accommodate a com- 
piler.^ 

The CL-II programming system consists only 
of a compiler and an operating system. The 
compiler is described as “syntax-directed" and 
can accept any of the present algebraic lan- 
guages for which it is provided the necessary 
input tables. It has been demonstrated that 
careful preparation of these tables enables the 
compiler to generate as compact code as an 
experienced programmer. The sophisticated 
operating system of CL-II is based on the con- 
cept of the “extended machine" first expressed 
by Holt and Turanski.®’ 

A good programming system should 

(a) Encourage modular construction of 
the computer programs. 

(b) Provide for data descriptions which 
are independent of the program until 
compilation occurs, 

(c) Relieve the programmers of the neces- 
sity of constructing substantial debug- 
ging environments. 

(d) Eliminate the need for developing 
elaborate control programs. 
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There seems to be no way to avoid all the 
human errors which are made in the course of 
developing a large computer program, but a 
good programming system can relieve the pro- 
grammer of many of the housekeeping prob- 
lems so that his thoughts can be directed to 
the peculiarities of the data system being auto- 
mated. 

If there is to be reasonable progress in the 
production of a large computer data system, 
the programmer’s work should be reduced as 
much as possible, and the initial system should 
be constructed with the anticipation that 
changes and extensions of the system will be 
required on a continuing basis. 

SUMMARY 

In closing, it seems worthwhile to summarize 
the main points of this discussion : 

(a) The users of the data system must col- 
laborate with the technicians through- 
out the design, construction, and test- 
ing of the system. 

(b) A full design is desirable, but a path 
should be chosen through this design 
so that a useful sub-system is avail- 
able as soon as possible. 

(c) Anticipate changes and extensions to 
the system. 

(d) Do not stint on the documentation. 

(e) Exercising the system fully will prob- 
ably require building a simulated 
environment. 

(f) Putting extra manpower on a project 
at the wrong time to accelerate the 
effort will usually accomplish little 
except to heighten the disappointment. 

(g) Large-scale s{)ecial purpose computers 
still seem inadvisable because of 
changing requirements. 

(h) The programmer who takes his pro- 
fession seriously does not find it to 
be an easy life. 

(i) Programming systems may not have 
reached perfection, but they offer the 
programmer some relief from many 


of the troublesome housekeeping de- 
tails. 

(j) Standardizing on a single algebraic 
programming language is apparently 
no longer required. 

No claim is made that these observations 
form an exhaustive list of all matters to be 
considered by a group embarking upon the 
development of a large computer data system 
to assist in the evaluational and decision making 
processes of command and control. Each per- 
son who has been involved in such an effort will 
have a few items to add. Perhaps even this 
partial list will help some group to avoid a 
few of the mistakes others have made. 
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SOME COST CONTRIBUTORS TO LARGE-SCALE 

PROGRAMS 

Burt Nanus and Leonard Farr 
System Development Corporation 
Santa Monica, California 


INTRODUCTION 

In the early days of computer technology, 
only a small handful of highly competent, sci- 
entifically-oriented researchers were familiar 
with the programming arts. In those days, the 
management of programming effort was only 
slightly different from the management of other 
types of research activities. Each project was 
unique and its probability of success uncertain ; 
experience was severely limited ; tools and tech- 
niques were custom-built for each job. In the 
computer field today, despite many thousands 
of man years of experience in program develop- 
ment, we still tend to plan and to manage as if 
each program were a unique research project. 
This is partly due to the immaturity of the field, 
and partly because we have not yet fully recog- 
nized that the similarities between computer 
programs and their development are far more 
extensive than the differences between them. 
We have seen the development of many new 
tools to make the programmer more effective 
in his work ; we must now ask whether we can 
develop new tools to make the program man- 
ager more effective in planning and organizing 
his scarce resources of talented manpower and 
expensive computer time. 

One of the most important requirements for 
management planning is an accurate estimate 
of the resources required for the completion of 
the project. In programming management, the 
two principal resources to be estimated, sched- 


uled and controlled are man months and com- 
puter hours. Together, these resources may be 
considered the cost of producing the program. 
Historically, these costs have been very poorly 
estimated ; there are abundant examples of 
actual costs that exceeded estimated costs by 
100 per cent or more. 

Because better cost estimation is an impor- 
tant step toward more effective programming 
management, because the costs of programs 
may be a significant portion of the total costs 
of large management or command information 
systems and because the estimates have been 
little better than guesswork to date, the Ad- 
vanced Research Projects Agency of the Office 
of the Director of Defense Research & Engi- 
neering sponsored some research in this area 
at the System Development Corporation (SDC) . 
Early efforts were aimed at data collection and 
analysis of several large-scale command and 
control programming efforts representing a 
total of more than two million instructions and 
1500 man years of work.^* This paper is a 
summary of a subsequent elfort to (a) identify 
the common factors that influenced the cost of 
developing programs and (b) perform a pre- 
liminary analysis on some of the data.* These 
are viewed as necessary first steps toward 
the development of a more accurate cost esti- 

* The full reseach report upon which this paper is 
based will be published as TM-1447, Factors that Affect 
the Cost of Computer Programming, System Develop- 
ment Corporation. 
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mating procedure. Further progress depends 
upon more systematic data collection, and one 
of the purposes of this paper is to recommend 
the types of data that should be collected by 
programming managers for estimation pur- 
poses. 

PROBLEMS ENCOUNTERED IN 
DETERMINING COST FACTORS 

In the process of identifying and analyzing 
cost factors, one soon encounters a number of 
limitations in the programming field that may 
not exist in other, more mature disciplines. 
Some of these problems include the following: 

1. Lack of Agreement on Terminology — 

There are no universally acknowledged 
definitions of many of the terms used in 
the computer programming process. For 
example, the words ‘"debugging,” “param- 
eter test” and “program validation” 
may all describe the same process ; a “pro- 
grammer” in one organization may be 
called a “coder” in another and a “system 
analyst” in a third. Although we at- 
tempted to keep within the more limited 
context of command and control pro- 
grams in our research, we found, even in 
this narrower field of programming, a 
widespread lack of agreement on termi- 
nology. 

2. Poor Definition of Product Quality — ^Ap- 
parently there has been little success in 
defining those attributes that characterize 
the nature or the quality of a computer 
program. For example, one hears pro- 
grammers talking in terms of flexibility, 
economy of memory, and maintainability, 
but there seems to be no generally agreed 
upon criteria for comparing programs on 
the basis of these attributes. 

3. Poor Quality of Cost Data — Present cost 
collection methods seem to be designed 
primarily for accounting purposes and not 
for planning or control. For example, the 
collections of costs are usually grouped by 
organizational units rather than by prod- 
uct or function to be performed. 

4. Dynamic Nature of the Field — Although 
computer programming is maturing as 


a discipline, there is still a wide diversity 
of techniques and approaches being de- 
veloped and used. As a result, any study 
of cost factors must consider the history 
and likely future trends of programming 
technology. 

5. Nonquantitative Nature of Some Factors 
— Experience has shown that many of the 
factors that affect the cost of computer 
programs are qualitative in nature. In 
some cases, it is possible to predict at 
least the direction that cost will be af- 
fected by an increase in a given factor. 
For example, one would expect that the 
more experience one had with the particu- 
lar type of program or computer involved 
in a given task, the less it would cost to 
perform that task. In other cases, qualita- 
tive factors appear to have a nonmono- 
tonic effect as when an increase in a given 
factor (e.g., management planning) first 
decreases and th6n increases total cost. 
Of course, determination of the magni- 
tude of the effect on cost of qualitative 
factors is even more difficult than deter- 
mining the direction of the effect. 

Although these problems combine to make an 
analysis of computer programming cost factors 
somewhat difficult, a start must be made if pro- 
gram development efforts are to be more effec- 
tively planned and managed. 

COST FACTORS 

It is possible to identify hundreds of factors 
that contribute to the cost of computer pro- 
grams, if such a level of detail is desired. In 
this paper, we will present a list of approxi- 
mately 50 such factors, consolidated from a 
much larger list. Obviously, some classification 
scheme is necessary for discussion or analysis 
purposes. Factors might be grouped by work 
phase, such as program design or test ; by man- 
agement activity, such as planning or evalua- 
tion; by general categories such as resources, 
requirements, or environment ; by units of cost 
measurement, such as man months or dollars; 
or by the classic accounting method of direct 
and indirect costs. However, these schemes 
seemed to cause difficulties because of ambi- 
guities and overlap; as a result, a new classifi- 
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cation scheme was developed and is illustrated 
in Table I, in which factors are divided into 
these categories: The Job to be Done, The 
Resources that are Available, and The Nature 
of the Working Environment. 


Since a full discussion of all of the factors 
is not possible here, only the most important 
one or two in each category will be briefly pre- 
sented; the complete list is included as an ap- 
pendix. 


TABLE I— COST FACTOR CLASSIFICATION SCHEME 


Logical Grouping 

Category Name 

Category Definition 

THE JOB TO BE DONE 

1. Operational Require- 
ments and Design 

Includes cost factors associated with the 
operating characteristics of the system 
for which the program is being written. 

2. Program Design and 
Production 

Includes cost factors associated with 
both support and operational programs 
as determined by the constraints im- 
posed by personnel, hardware and oper- 
ational requirements. 

THE RESOURCES 

THAT ARE 

AVAILABLE 

3. Data Processing Equip- 
ment 

Includes cost factors associated with the 
data processing equipment required to 
produce and test a program, including 
all input, output and peripheral equip- 
ment. 

4. Programming Person- 
nel 

Includes cost factors resulting from the 
direct labor needed to completely de- 
velop a program. 

THE NATURE OF 

THE WORKING 
ENVIRONMENT 

5. Management Proce- 
dures 

Includes cost factors associated with the 
plans, policies, practices and review 
techniques used in the administration of 
all phases of program development. 

i 

6. Development Environ- 
ment 

Includes cost factors resulting from 
relationships of the programming staff 
with other organizations, such as cus- 
tomers and other contractors. 

7. Facilities, Services and 
Supplies 

Includes cost factors related to supplies, 
physical plant, indirect labor and over- 
head. 


THE JOB TO BE DONE 

1. Operational Requirements and Design 
The factors in this category tend to center 
around the question, ''How well are the opera- 
tional requirements of the information system 


known?'’ or, "How well is the problem de- 
fined?" Unfortunately, it is virtually impos- 
sible at the current state-of-the-art to evaluate 
the cost-contributing effect of many of the fac- 
tors which relate to this question. 
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The primary cost factor in this category 
would appear to be the extent of innovation in 
the system, in its components, and especially 
in the automatic data-processing function. The 
extent of similarity of the new system to older 
systems may be a clue to estimating how clearly 
or easily the requirements of the new system 
can be stated. If the requirements are well 
known to the program designers, the program- 
ming job is more straightforward and less 
costly. Somewhat related to this factor is the 
extent to which the programming organiza- 
tion will participate in the formulation of re- 
quirements; the less active its role in deter- 
mining requirements, the more likely misunder- 
standings will develop, resulting in costly er- 
rors, omissions and ambiguities. 

2. Program Design and Production 

In a large-scale program system, program 
design involves the determination of the broad 
logical subdivisions of the computer system, the 
design of an executive program to control the 
sequencing of programs, the design of the data 
base structure, the allocation of computer stor- 
age, and specifications for utility and support 
programs. As in the first category, these fac- 
tors center around the question, ''How clearly 
understood are the program requirements?'’ 
The factors also are concerned with the size and 
complexity of the job; the resources and tools 
available ; and the plans for documenting, veri- 
fying and testing the product. 

Undoubtedly, the most important cost factor 
in this category is the number of computer pro- 
gram instructions and the types of programs 
that must be produced. In current techniques 
for estimating cost, the size of the program is 
often used as an intermediate measure to esti- 
mate the number of men who will be assigned 
and the number of computer hours required. 
Despite this reliance on size as the key to cost 
estimation, it appears that little research has 
been done to develop systematic and reliable 
ways to predict the number of instructions. 

So far, the experience of many program man- 
agers is that the number of instructions is often 

* R. Bleier of SDC has reported on our attempt to 
relate total program length to the frequency of certain 
decision-class instructions, TM-1603. 
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Figure 1. Man Months Versus Program Size 
for Eleven Large-Scale Programs. 

grossly underestimated, except when very simi- 
lar programs can be used for comparison.**^' ^ 

The conversion of the estimate of number of 
instructions to programming man months for 
large-scale systems is frequently done by allow- 
ing one man month for each 200 instructions. 
To test this rule of thumb, empirical data on 
the number of machine language instructions 
in eleven large command and control systems 
were compared with the 200- instruction-per- 
man-month guideline (Figure 1). The pro- 
grams represent a variety of command and 
control systems using several different compu- 
ters and languages. The number of man months 
includes program design, testing and coding. 
A further analysis of the data revealed that the 
production rate for operational programs aver- 
aged 225 instructions per man month while the 
rate for utility programs was 311 instructions 
per man month. The explanation for the higher 
rate of utility program production was that the 

** For example, a report of the Controller’s Institute 
stated, . . almost all EDP groups have at one time 
or another seriously underestimated the number of 
steps required. . . . Every company we visited added 
a substantial safety factor varying from 20% for a 
company which claimed, due to experience, a reasonable 
accuracy in its estimating procedures, to 400% for a 
company which had found itself that far out on a 
previous estimate.” (Business Experience with Elec- 
ironic Computers, New York Controller’s Institute Re- 
search Foundation). ‘‘Measuring the Profitability of a 
Computer System.” See also J. D. W. James. 
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program developer is his own customer for the 
program system and, therefore, can write his 
own requirements with little external coordina- 
tion. For smaller programs not shown on the 
figure (i.e., less than 10,000 instructions), rates 
of as much as 400 to 1000 instructions per man 
month for individual programs were reported. 

The conversion of the estimate of number of 
instructions to computer hours is also subject 
to various rules of thumb. Figure 2 is based 
on experience with eight large programs. The 
three points falling below the line represent 
efforts in which a procedure-oriented language 
was used, indicating that such use may reduce 
the amount of machine time required for pro- 
gram development. As shown in the figure, one 
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computer hour is required for approximately 
every 53 instructions. 

Another hypothesis resulting from the same 
data is illustrated in Figure 3. This shows a 
near-linear relationship between the two re- 
sources of manpower and computer time. It is 
likely that better information for predicting 
the number of computer hours required for the 
production of programs of various sizes and 
complexities is available in most programming 
organizations although it is often buried in ac- 
counting data. 

A second important factor in this category 
is the extent of support program availability, 
reliability and documentation, including utility 
programs, debugging programs and library 
routines. Clearly, the more support programs 
that have to be produced ‘‘from scratch,’’ the 
more manpower and computer time will be re- 
quired for the total program development. 

One other factor that may have a considera- 
ble influence on cost is the number and types 
of documentation produced for various types 
of programs. The graph in Figure 4 is based 
upon an analysis of five large-scale program- 
ming efforts. It suggests that there is a linear 
relationship between the number of pages of 
documentation actually produced to satisfy con- 
tract requirements and the number of instruc- 
tions in the program. These data represent 


Figure 2. Computer Hours Used as a Function of 
Program Size for Eight Large-Scale Programs. 
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Figure 3. Man Months Versus Computer Hours for 
Seven Large-Scale Programs. 


Figure 4. Pages of Documentation Versus Program 
Length for Five Large-Scale Programs. 
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documents delivered to customers. There may 
be many times this number of pages actually 
written in a large programming effort. 

Some other guidelines for estimating the 
costs of documentation have been suggested. A 
drafting rate of 3 to 5 pages per day (750-1250 
words) is a good rule of thumb for various 
types of programming documentation. A tech- 
nical review rate of 20 pages per day seems 
average, but reviews that do not require exten- 
sive rewrites may run to 50 or 100 pages per 
day. Estimates on typing rates, illustrating 
rates and duplicating rates are usually fairly 
easy to obtain in most organizations. 

RESOURCES THAT ARE AVAILABLE 

3. Data Processing Equipment 

The development of computer hardware is 
proceeding at such a rapid rate that it is diffi- 
cult to make long-range estimates (beyond a 
few years) concerning its effect upon program- 
ming. Such improvements as faster add times, 
greatly increased memory capacities and speeds, 
multi-processors and new input/output devices 
may profoundly affect the accuracy of program- 
ming-cost estimation. 

With respect to equipment, a critical factor 
is the number of hours per day that the com- 
puter is available to the programming staff. A 
commonly held intuitive notion is that the more 
hours per day the computer is available to 
programmers, the lower the over-all cost of the 
programming effort. Among the considerations 
in determining the number of hours per day of 
availability are the number of shifts per day, 
the time required for preventive maintenance 
and the number of other computer users shar- 
ing the equipment. Of course, another strong 
influence is computer capability; for example, 
large memory capacities seem to make pro- 
gramming easier. Further considerations are 
the power of the order code, the speed of access 
to primary and secondary memories, operating 
time and speed of input/output gear. 

4. Programming Personnel 

The most important factor in this category 
relates to the experience of the programmers 


assigned to the job. There are three particu- 
larly important types of experience : 

(a) Experience with the particular com- 
puter — Clearly the more experience a 
man has with the particular machine for 
which the program system must be de- 
signed, the more apt he is to be familiar 
with its capabilities and therefore the 
less time it should take him to do the 
programming and testing. 

(b) Experience with the particular language 
— Programming languages differ in 
their suitability for various types of 
programming efforts. Familiarity with 
an appropriate special language, such as 
NELIAC or JOVIAL, certainly makes 
the programmer more efficient and re- 
quires a smaller number of man months. 

(c) Experience with the particular applica- 
tion — If the programmers have experi- 
ence with the particular type of system 
application and/or design being pro- 
grammed, then less time should be 
needed for the early analysis and defini- 
tion phases of the job. 

In addition to experience, cost is affected by 
the number of man months of programmer 
training required for the project. A possible 
hypothesis is that the cost of programmer 
training may take the form of a U-shaped 
curve. That is, for any given task there seems 
to be an optimum amount of training that the 
programmer should have. More training pre- 
sumably would not produce a commensurate 
return, while less would lead to errors and con- 
fusion in the programming. 

The number of people to assign to a given 
function or task also has an important impact 
on cost. There is probably an optimum number, 
for each type of programming effort, although 
research has not been conducted to determine 
what this is. In the field of research and de- 
velopment, which has some similarities with 
programming, the optimum number in a work 
group seems to be between 4 to 7.® 

There are also costs associated with obtain- 
ing personnel — either in hiring or in transfer- 
ring from other contracts and in relocating 
them, if necessary. This is a function of both 
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employee turnover and the size and type of the 
project. A study is currently being sponsored 
by the Navy at the University of Southern 
California to analyze the job of the computer 
programmer, develop criterion measures of per- 
formance and determine optimal personnel se- 
lection and classification procedures.^ 

NATURE OF THE WORKING 
ENVIRONMENT 

5. Management Procedures 

The design and institution of clear-cut man- 
agement procedures may seem costly at the out- 
set, but their true value must be determined by 
comparing the cost of the plan with the cost 
of not having the plan. Both these costs are 
extremely difficult to determine but experience 
in many programming efforts indicates that 
well planned projects enjoy higher productivity 
rates. 

The most important effect upon total cost in 
this category is the use, maintenance, and 
monitoring of a management plan that includes 
at least communications and decision-making 
procedures, mechanisms for handling changes 
in the program, delineations of responsibility 
and authority, and schedules for major mile- 
stones and products. The plans should also 
specify standards for flow charts and docu- 
mentation and quality .control procedures. There 
is probably some optimum per cent of time that 
should be spent in planning for program devel- 
opment. 

Another important cost factor is the number 
of computer runs permitted to each program- 
mer each day. One of the reasons for the cur- 
rent activity in the development of time-sharing 
procedures is the belief that a larger number 
of computer runs per day for each programmer 
will shorten lead times and decrease costs. An 
equally reasonable hypothesis is that a cost 
trade-off exists between desk checking and com- 
puter testing such that some optimum number 
of computer runs will minimize total cost, the 
optimum depending upon the relative cost of 
computer time and programmer time, 

6. Development Environment 

A particularly important problem is the num- 
ber of agencies with which the programming 


staff must coordinate. In addition to the user 
or customer, there may be separate agencies 
responsible for the contracting and for the 
other aspects of the system, such as hardware 
development. Problems of coordination and con- 
currence multiply as a function of the number 
of groups with which the programming staff 
must deal. Often, mutual education between 
these various staffs becomes necessary and the 
cost of briefings and meetings for such purposes 
may make the cost per instruction higher for 
large-scale program systems than for smaller 
ones. 

7. Facilities, Services, and Supplies 

In most accounting systems, normal overhead 
and miscellaneous supplies are covered by a 
percentage addition to the estimated direct 
labor and materials. However, unusual expendi- 
tures associated with large-scale programming 
efforts may not be adequately covered by the 
average overhead burden rate. For example, 
various types of technical and administrative 
support are needed to assist the programmer. 
Computer operators and EAM personnel save 
the programmer considerable time during test- 
ing. Effective and experienced management and 
administrative personnel assure that the work 
is efficiently organized and free the program- 
mer to concentrate on technical matters. Tech- 
nical editors help to ensure that documentation 
is adequate and understandable. There is some 
optimum mix of such support personnel that 
will ensure minimum cost. 

Another important factor in this category is 
travel and communication cost. Trips may be 
required for briefings and conferences with 
user organizations and associated develop- 
mental agencies ; for data gathering ; for train- 
ing and familiarization; and for concurrence 
on requirements or design. There is a strong 
tendency to underestimate these costs or to 
curtail them as an economic or political meas- 
ure; this often results in delays in getting data 
or concurring on programming details, and 
these delays may be very costly in terms of 
time and manpower. 

In some cases an important consideration is 
the cost of special facilities, e.g., simulation de- 
vices, special office equipment, or computer 
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room facilities. Special wiring and air condi- 
tioning, false flooring, space for storage and 
movements of parts and equipment, mainte- 
nance and test hardware, etc., all contribute to 
the cost. 

SUMMARY AND CONCLUSION 

In this paper we have listed some cost factors 
that represent a consolidation and skimming of 
a larger list of factors that contribute to the 
cost of computer programming eiforts. In many 
cases, the factors are very difficult to measure 
or quantify; further, their effect upon other 
factors and upon the total costs of the program- 
ming effort is often difficult to determine. For 
many of the factors, some data exist in current 
accounting records, but these data have not 
been collected, compiled and analyzed. 

The mere listing of cost factors in program- 
ming is only a first step, albeit an important 
one, toward the development of a more scien- 
tific and, hopefully, a more precise method of 
estimating the cost of programming efforts. 
Ultimately, one would hope to discover by 
analysis of data some predictors that would 
enable a more accurate estimate of the number 
of instructions, man months and machine hours. 
Among the cost data — both estimated and real 
— that we would hope programming managers 
would begin to accumulate for the various types 
of products and activities in their projects are 
the following : 

1. The number of machine instructions and 
the programming language used. Also, 
the percentage of the finished program 
consisting of library routines and sub- 
programs from previous programs. 

2. The number of man months of program- 
mer effort, including the first level of 
supervision, and the experience level of 
the programmers. 

3. The number of hours of machine time re- 
quired for testing and debugging pur- 
poses and the types of machines used. 
Also, the pattern of machine usage in 
terms of runs per day and hours per run. 

4. The number, types, and timing of impor- 
tant program changes and, in at least a 
qualitative sense, the effects of these 
changes on the final product. 


5. The types and number of pages of docu- 
mentation required, including a notation 
as to whether they are single or double 
spaced. 

In addition, it would be useful if a log could 
be kept by a project “historian"' describing cer- 
tain qualitative attributes such as those de- 
scribed earlier in this paper. This section should 
describe the data-processing functions of the 
program system and its relationship to other 
systems. It should also identify all interim and 
end products, such as types, listings and de- 
scriptive documents. 

A series of experiments to study those fac- 
tors that can be analyzed only in controlled en- 
vironments ideally should be conducted simul- 
taneously with the collection of new and exist- 
ing data on program costs. For example, well 
designed, statistical experiments would be use- 
ful for determining the effects of different types 
of programming languages upon total cost, the 
effects of greater or lesser machine availability 
upon costs, the optimum size of programming 
staff for different types of programs, and the 
best mix of programming talent (experienced 
versus inexperienced) for given types of jobs. 

Obviously, it may be quite some time before 
a valid predictive set of equations can be de- 
veloped for program cost estimation. Never- 
theless, it is to be hoped that each small step 
in this direction will represent a useful experi- 
ence in itself in terms of increased insight into 
the programming process. Certainly, research 
into techniques for improved programming 
management should be encouraged if the in- 
dustry is to keep pace with the increasing de- 
mands for its services. 

APPENDIX I LIST OF COMPUTER PRO- 
GRAMMING COST FACTORS 

Summarized below for the convenience of the 
reader is the complete list of cost factors dis- 
cussed in this paper. 

Operational Requirements and Design 

1. Extent of innovation in the system, its 
components, and especially the automatic 
data-processing function. 
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2. Extent to which the programming contrac- 
tor will participate in a determination of 
the information processing needs (i.e., the 
system and operations analysis, and the 
system and operational design) . 

3. Number, size, frequency, and timing of 
system design changes. 

4. Extent of command and control system de- 
centralization and number of interfaces. 

5. Number of other components and subsys- 
tems being developed concurrently as part 
of the command and control system (e.g., 
intelligence, sensor, etc.). 

Program Design and Pi^oduction 

1. Number of computer program instructions 
and the types of programs that must be 
produced. 

2. Number, types, and frequency of inputs 
and outputs to the computer (s) . 

3. Extent of innovation required in the pro- 
gram system; that is, the degree to which 
programs are similar in nature to those 
previously written. 
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and documentation for both customer and 
internal use. 

5. Extent of complexity of the data-process- 
ing functions. 

6. Degree to which the following program 
design characteristics are recognized and 
must be incorporated. 

(a) Maintainability — ^the ease with which 
program errors can be detected and 
corrected. 

(b) Changeability — the ease with which 
new functions can be incorporated in 
the program. 

(c) Usability — the ease with which per- 
sonnel other than designers can use 
the program. 

(d) Flexibility — the ease with which the 
program can be used for other pur- 
poses with only slight modification 
(e.g., SAGE programs for air traffic 
control) . 

(e) Generality — the ease with which a pro- 
gram can accept a wide range of in- 
puts. 


7. Extent of the constraints on program de- 
sign (e.g., real-time requiremnets). 

8. Number, size, frequency, and timing of 
program design changes. 

9. Extent to which data for data base is avail- 
able, or data collection is required. 

10. Number of entries (total size) for the data 
base, the number of different types of data 
needed for it, and the extent to which each 
can serve many programs or subprograms. 

11. Efficiency of the programming language 
and the compiler or assembler. 

12. Extent of the, completeness and clarity of 
the system test and acceptance test re- 
quirements. 

Data Processing Equipment 

1. Number of hours per day of computer 
availability. 

2. Extent of capability of the computer and 
its suitability for the job to be done. 

3. Extent to which the operation of the com- 
puter and peripheral equipment is reliable, 
well tested, and well documented. 

4. Number of equipment components being 
developed concurrently with the program. 

5. Number of different computers for which 
programs are being prepared. 

6. Number and types of displays used. 

7. Extent to which adequate EAM support 
will be available. 

8. Extent to which routine preventive and 
emergency maintenance will be available. 

Programming Personnel 

1. Types and quality of programmers. 

2. Number of man months of programmer 
training required. 

3. Number of programmers to be assigned to 
a given function or task. 

4. Policy of obtaining and phasing of person^ 
nel to staff a new contract. 

5. Rate of turnover. 

Management Procedures 

1. Extent of use, maintenance, and monitor- 
ing of effective management plans within 
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both the customer’s and program devel- 
oper’s organizations. 

2. Extent of formalized procedures to use the 
computer facility. 

3. Extent to which there is a well defined and 
controlled system change procedure. 

4. Extent of an error-reporting and -correct- 
ing procedure. 

5. Extent of contingency plans in the event 
the computer is overloaded or otherwise 
unavailable. 

6. Extent of quality control that is exercised 
during testing (e.g., reliability require- 
ments) . 

Development Environment 

1. Number of agencies the programmer con- 
tractor must deal with and their level of 
experience with system development. 

2. Average number of days and effort re- 
quired for concurrence. 

3. Travel requirements. 

4. Extent to which delivery dates for re- 
quired programming tools are reliable, and 
correspondingly, the amount of pressure 
caused by a tight schedule. 

5. Extent to which the computer is operated 
by another agency. 

Facilities, Services, and Supplies 

1. Number of computer operators and EAM 
personnel required. 

2. Number and experience of technical man- 
agement personnel, administrative person- 
nel, and technical editors. 

3. Cost of special simulation facilities, com- 
puter room facilities or special office equip- 
ment. 

4. Number of square feet of new office space 
or building required. 

5. Exceptional costs of graphic arts and re- 
production. 


6. Cost of punched cards, magnetic tape and 
other special supplies or equipment. 

7. Cost of special security requirements (e.g.. 
Top Secret vault). 
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INTRODUCTION 
Some Historical Notes 

During the nineteen-fifties the capabilities of 
electronic computers expanded so fast as to stay 
well ahead of the needs of the average computer 
user. Such was the case in both the analog com- 
puter and scientific digital computer fields. One 
effect of this situation was the formation of two 
schools of experts with opposite views on the 
choice of the “best” general purpose scientific 
computer. Differences of training, experience, 
and semantics led to a serious barrier to com- 
munications between these two groups. At best, 
the fashionable, middle-of-the-road position was 
to admit that each computing technique “had its 
place,” y/hich did little to break down the bar- 
rier. Only with the appearance of a computa- 
tional task that could not be accomplished sat- 
isfactorily by either type of general purpose 
computer — only then was the barrier cracked 
and a small opening made. 

In the aircraft and the budding aerospace 
industries the analog computer had long been 
the primary, if not the only, means employed 
for simulation of new or proposed engineering 
designs. In fact the analog computer grew up 
as the aircraft simulation tool. A mathematical 
model of each new design was formulated and 
an electronic analog model formed on the com- 
puter. Performance data obtained from wind 
tunnel tests were added to complete the model, 
and large numbers of experimental “flights were 


flown” as the effects upon the dynamic behavior 
of the model, with changes in design parame- 
ters, were examined. The characteristics and 
performance of the simulation model often pro- 
vided input data for a digital computer program 
that would evaluate, optimize, and calculate 
final design specifications. The digital computer 


was also useful as a simulatioii tool in calculat- 
ing orbits and ballistic trajectories and in de- 
signing guidance and control systems for such 
aerospace flights. 


While conventional computer simulation tech- 
niques were quite adequate for most aerospace 
design problems, the exceptional costs and haz- 
ards involved in missile and manned vehicle 
missions required computer simulation of com- 
plete missions. Some full mission simulations 
required inclusion of actual control mechanisms 
and of a manned cockpit, thereby dictating 
simulation on a real-time scale. It was apparent 
that such total system simulations exceeded the 
capabilities of any single type of computer. Only 
by the combined application of analog and digi- 
tal computers could adequate simulation be 
accomplished. It was hoped that the special 
features of the two computer types could be 
combined to form a hybrid simulator of su- 
perior performance. 


Thus the earliest attempts to combine the 
computation of analog and digital computers 
took place in about 1958 at the Convair Astro- 
nautics plant in San Diego and at the Space 
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Technology Laboratories in Los Angles. **-2''^* 
In both cases the job at hand was the complete 
mission simulation of the trajectory of a long 
range missile. The speed of the analog com- 
puter was a necessary element in the study to 
permit a “real time” simulation of the rapid 
motion of the vehicle and of control surfaces. 
However, the dynamic range required of the 
simulation was in excess of that of the best 
analog computers. That is, the ratio of the total 
range of the trajectory to the required terminal 
phase resolution (a dynamic range of 10® to 
10'^) was greater than 10^, the upper limit of 
analog computer dynamic range for small pro- 
grams. Hence the digital computer was used to 
calcula^^ those variables for which such dy- 
namic range was necessary. The most im- 
portant of these were the navigational coordi- 
nates — the digital computer performed the open 
integration of velocities to determine the 
vehicle’s position, plus the dynamic pressure, a 
function that is very sensitive to altitude and 
velocity. 

It is fortunate that in such long range aero- 
space trajectory simulations the variables with 
wide dynamic range requiring precise calcula- 
tion are not, at the same time, rapidly changing. 
Moreover the “high speed” variables do not re- 
quire precise calculation. The early combined 
computer systems employed the largest and 
fastest digital computers available at the time — 
Univac 1103A and IBM 704 — together with 300 
to 400 amplifiers of general purpose PACE 
analog equipment. In both cases even these fast 
digital computers were only just fast enough to 
perform the required repetitive calculations for 
the slowly changing variables of the simulation 
in real time. 

Since the installations of the first combined 
computers at least a dozen computer labora- 
tories have employed general purpose analog 
and digital computers together to solve simula- 
tion problems, and a number of attempts have 
been made to devise spacial purpose systems 
of analog and digital devices. Among the latter 
are the CADDA of the National Bureau of 
Standards and the “pulsed analog computer” of 
the MIT Electronic Systems Laboratory.^®'ii-i^> 
is.36,42 Hybrid computers of a unique type are 
the combinations of a general purpose digital 
computer and a digital differential analyzer 


(dda), illustrated by the Bendix G15 with the 
DA-1 attachment and the Packard Bell PB250 
with the Trice dda.*'^ The former system con- 
sists of a small, slow computer with an even 
smaller serial dda ($50,000 and $10,000 re- 
spectively) . In contrast to this the Packard Bell 
system combines a small, medium speed com- 
puter ($40,000) with a large serial-parallel dda 
($500,000). Among the systems of general pur- 
pose computers, generally large analog com- 
puters have been combined with both large 
(IBM 7090) and small digital computers (PB 
250, LGP 30).2-3-’-17-19,21 

A brief analysis of the applications to which 
existing installations of combined systems have 
been applied leads to these generalizations. For 
the most part the analog computers in these sys- 
tems have been employed in a normal manner 
to simulate the dynamic behavior of physical 
systems by solving sets of non-linear, ordinary 
differential equations, while the digital com- 
puter has performed one or more of the follow- 
ing three functions : complex control logic, stor- 
age of arbitrary functions or sampled analog 
functions, and high precision arithmetic pri- 
marily for numerical integration. Examples 
of the applications are : 

a. Analog computer plus digital control logic. 

A system that in itself contains discrete con- 
trol functions of continuous dynamic variables 
is appropriately simulated by a hybrid com- 
puter. The kinetics of a chemical process are 
simulated by continuous analog means while its 
digital control system is represented by a digital 
program.29-«i Similarly an aerospace vehicle 
with an on-board digital computer, control sys- 
tem, or autopilot is simulated by hybrid tech- 
niques. 

b. Analog computer plus digital memory. 

A very common difficulty in the simulation of 
a chemical or nuclear reactor is providing an 
adequate representation of the transport of 
fluid in pipes from one point to another — from 
reactor to heat exchanger. The simulation of 
this transport delay of a dynamic variable, such 
as the time variation of the fluid temperature, 
is very nicely accomplished by the use of a digi- 
tal computer for storage of the temperature 
function for a fixed, or variable, length of time. 
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Digital computer memory has also been used 
effectively to store multivariable arbitrary func- 
tions — an operation which is seriously limited 
in the analog computer. 

c. Analog computer plus digital arithmetic. 

This type of application is the “classic” one 
where the digital computer is used to perform 
precise, numerical integration of space vehicle 
velocities to keep track of the exact position of 
the vehicle over a very long range flight. 

It should be noted that a significant difference 
is apparent in the applications of computer sys- 
tems employing a very small digital computer 
and those with large, very fast computers. In 
general the small machines are limited to execu- 
tion of control logic programs, one or two chan- 
nels of transport delay simulation, or limited 
function generation programs. Since numerical 
integration and complex function generation by 
digital programs require considerable time for 
each calculation, for each discrete step in time, 
only the fastest digital machines can be used 
effectively for these tasks. 

SOME COMMENTS ON THE EVOLUTION 

riT7» XJVD'DT'n CTHTTTT A 'T’TrkXT 

J.X X kJxxTX ±j.n. X xv/x^ 

The term “computer simulation” appears in 
so many contexts it is important to emphasize 
that its use in these pages is limited to the 
modelling of complex physical systems for the 
study of their dynamic behavior. These systems 
are represented by sets of differential equations, 
algebraic equations, and logic equations. As in 
most simulation studies the objectives of hybrid 
simulation may be experimental design, pre- 
diction and control, and design evaluation, veri- 
fication, or optimization. It is not expected that 
the important applications of hybrid simulation 
will include: data processing system simula- 
tion ; information handling simulation ; business 
system simulations; siiaulation of television 
coding, character reading machines, communi- 
cations coding systems, etc. . . . These are all 
digital computer simulation applications. Simi- 
larly hybrid computers are not warranted for 
the simulation of circuits, devices, and systems 
for which the analog computer is quite adequate. 
It is in the simulation of total systems that 
bring together components, some of which are 
suited for digital and some for analog simula- 
tion, that the newer hybrid techniques are re- 


quired. There probably are few, if any, simple 
hybrid computer applications. 

If hybrid computation can be said to be a 
field of endeavor it must be considered to be in 
the formative stage. Developments to date have 
led to equipment configurations and program- 
ming techniques that were dictated by specific 
problems and limited objectives. The growth 
rate of the field will be determined by the ex- 
tent to which a broader view is taken of hybrid 
computer programming techniques and appli- 
cations. The greatest advances in computers 
have been made when the experiences of users, 
programmers, have been brought to bear on 
design of equipment. For the most part, hybrid 
computers of today consist of general purpose 
analog and digital machines, which are not de- 
signed for hybrid operation, tied together by 
“linkage equipment” designed only to solve the 
communication problem. This has been a nec- 
essary first step. Newer hybrid systems will be 
designed not just for communications but for 
efiicient solution of hybrid problems and for 
convenient programming. The purpose of the 
following discussions is to bring attention to 
the essentials of hybrid computation and their 
relationship to problem solutions. Out of this 
should come some indication of how present day 
computers can be most effectively employed. 
The next generation of hybrid computers will 
undoubtedly achieve a greater degree of inte- 
gration of parallel computing elements with the 
sequential stored program principle. Eventu- 
ally patch panel programming of parallel ele- 
ments will be replaced by an automatic system, 
thus affording a fully automatic method for 
computer set-up and check-out. Even today a 
secondary activity of the digital part of the 
hybrid machine is the partial automation of set- 
up and check-out of the analog computer. This 
feature becomes increasingly important as the 
computer system grows in size and the pro- 
grams grow in complexity, for the attention of 
the programmer and problem analyst needs to 
be directed to the simulation itself rather than 
the simulator. 

THE ELEMENTS OF HYBRID COMPUTERS 
Digital Computers 

Many conflicting factors have influenced the 
choice of digital computers used in hybrid sys- 
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terns. Computer speed and economics have 
probably been dominant. Since there are so 
many computers on the market today that have 
sufficient speed and that span the complete 
range of prices, it is more instructive to ex- 
amine the features that are essential for hybrid 
computation. 

Speed. The speed of execution of arithmetic 
operations is most important, and this is a func- 
tion of memory access time and multiplier 
speed. The access speed of drum and delay line 
memories is too slow. Magnetic core access 
times of 2 to 5 microseconds are currently popu- 
lar. This means the time for addition of two 
numbers is 4 to 10 microseconds. Multiplication 
and division take longer — ^times of 15 to 40 
microseconds are generally available and quite 
satisfactory. Overall program spee^ can be in- 
creased by the use of index registers — ^three 
registers is desirable ; more are useful. Special 
instructions for subroutine entry, for executing 
commands out of sequence, and for testing and 
skipping can help increase computing speeds. 

Word Structure. The basic requirement is for 
a fixed point, binary word of at least 24 bits. 
Since round-off errors affect the last several 
bits a smaller word size would result in a 
dynamic range limitation of less than 10«. A 
longer word may be useful in a few applications 
where fixed point scaling may be difficult. Float- 
ing point computations may make things easier 
for the programmer but should not be depended 
upon at the expense of computational speed. 
It may be noted that the equivalent of fixed 
point scaling is a necessary part of the analog 
program, and hence fioating point operations 
may not prove as advantageous as for some all- 
digital programs. Decimal format and char- 
acter oriented machines do not offer any advan- 
tages for hybrid computation, and usually they 
are slower than equivalent sized binary com- 
puters. 

Input/Output. High data rates in and out of 
core memory and any feature that minimizes 
loss of computing time for input/output opera- 
tions are highly desirable. In addition a fast, 
flexible means for communicating control sig- 
nals to and from the analog section of the 
hybrid system is necessary. Three kinds of con- 
trol signals are usually provided : interrupt and 
sense lines, and output control signals. It is by 


means of these controls that the sequential op- 
erations of the digital machine are made com- 
patible with the parallel, simultaneous opera- 
tions of the analog machine. Since communica- 
tions must be made with many points in the 
analog computer, a number of these control sig- 
nals are needed. Interrupt signals, from out- 
side the computer, stop the current sequence of 
calculations and force transfer to another 
sequence. Sense lines simply indicate to the 
digital program the current state of devices 
outside the computer. They may be sensed by 
specific programmed instruction. Other pro- 
grammed instructions will send control signals 
outside the computer on the output control 
lines. 

Memory. As noted above the digital com- 
puter main memory should be a high-speed 
magnetic core. Since most hybrid applications 
do not require a large memory for either pro- 
gram instructions or data, four, eight or twelve 
thousands words of core memory should suffice. 
Larger memories may be desired for special 
digital programs and larger hybrid problems 
when more experience has been gained in this 
field, thus expandability of a memory to 16K 
words is a good feature. Newer computers are 
being introduced with small, very high speed 
“scratch pad” memories. Such memories may 
have cycle times less than a microsecond and 
are used to store intermediate arithmetic re- 
sults. This feature increases the overall compu- 
tation speed of the computer. 

The normal manner of operating an analog 
computer involves a fair amount of non-com- 
puting time when the computer remains in the 
Hold or Reset mode. These intervals may range 
from seconds to minutes while adjustments are 
made, pots are set, or recorders changed, or 
while the programmer analyzes results. It is 
not possible for the analog computer to operate 
on other programs at these times, however, 
with a hybrid system, where such waiting 
periods are likely to occur also, it is reasonable 
to consider having the digital computer work 
on a different program during the intervals, 
whatever their length. Appropriate “interrupt” 
and “memory lock-out” features are possible to 
permit time sharing of the digital machine 
without affecting the hybrid program and with- 
out the danger of one program interfering with 
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the other. The secondary program (a strictly 
digital problem) is simply stored in a “pro- 
tected” part of the core memory and utilizes all 
the bits of time not required by the hybrid 
program. 

Peripheral Equipment. In many digital com- 
puter installations the investment in peripheral 
equipment rivals that in the central computer. 
Current hybrid computer applications require 
only a minimum of digital peripheral equip- 
ment. The graphic output equipment associated 
with the analog computer is sufficient for com- 
putational results. Punched paper tape reader 
and punch and typewriter may be all that is 
required for programming. Larger systems in 
the future will employ punched cards and mag- 
netic tape, primarily for rapid change-over of 
problem and automatic check-out. Large off- 
line data storage does not api)ear necessary for 
most applications. 

In summary the digital computer must be 
characterized as a sequential machine. For 
effective use within a hybrid system the ma- 
chine (a) must have sufficiently high internal 
speed for it to appear as though a number of 
calculations were taking place simultaneously; 
(b) must be organized for maximum speed in 
executing mathematical calculations; and (c) 
must have efficient means for input and output 
of data during calculation (Figure 1). 
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Figure 1. The digital computer must be characterized 
as a sequential machine. For effective use within a 
hybrid system the machine must have sufficiently high 
internal speed for it to appear as though a number of 
calculations were taking place simultaneously. It must 
be organized for maximum speed in executing mathe- 
matical calculations; and it must have efficient means 
for input and output of data during calculation. 


Analog Computers 

In contrast to the above the analog computer 
is a parallel machine, with many computing 
components and I/O devices operating in con- 
cert. There are few, if any, features of the mod- 
em analog computer that are not appropriate 
to a hybrid system. However, only the largest 
analog machines have been used for general 
purpose hybrid simulation. The common meas- 
ure of a large computer is that it has 100 to 200 
operational amplifiers. Since two or more com- 
puters may be “slaved” together, larger systems 
are possible when required. 

Analog computer features that are important 
for hybrid systems can be simply listed as : 

— Integrators with multiple time scales 

— Amplifiers for tracking and storing volt- 
ages 

— ^Very fast control of the modes of indi- 
vidual amplifiers 

— Automatic, remote control of the setting of 
potentiometers 

— Fast, accurate multipliers and trigono- 
metric resolvers 

— High speed comparators with logic signal 
outputs 

— Electronic switches (logic signal gating of 
analog signals) 

In the early days logic equations or switching 
functions were programmed with relays and 
stepping switches, which were connected to the 
patch board by various means. Present day 
technology employs electronic switching of inte- 
grator modes and voltage signals at high 
speeds, and the delays inherent in relay devices 
can no longer be tolerated for logic operations. 
The logic building blocks common to the digital 
computer designer (flip-fiops, gates, inverters, 
monostable multivibrators, shift registers, and 
counters) are ideally suited to these operations. 
Thus with electronic switches replacing relay 
contacts, logic modules have become an integral 
part of all new, large, analog systems. These 
modules are programmed like the other analog 
components by interconnections at a patch 
panel. Many signals occur simultaneously but 
they are logic signals — ^two values. Zero and 
One, that change as functions of time. Input 
signals to logic programs come from compara- 
tors, manual switches, and external control sig- 
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nals. Logic program outputs go to integrator 
mode controls, storage amplifier controls, elec- 
tronic switches (DA switches) to gate analog 
signals. As will be shown later, it is essential 
for a hybrid system to have a very significant 
complement of digital logic components. A few 
gates and flip-flops are not sufficient. The 
potentialities for use of logic components in an 
analog computer for hybrid operation are so 
great that the EAI HYDAC Digital Operations 
System is an entire computer console with its 
own patching system used entirely for the pro- 
gramming of digital components for parallel 
computation. This console is really a complete 
logic computer. It is used together with a con- 
ventional analog computer to form what is truly 
an all parallel hybrid computer, *• 2*- 24 . 29 . 40. 41 

In summary the modem analog computer 
must be characterized as a parallel machine. It 
is not solely a computer for continuous vari- 
ables. It is a parallel assemblage of building 
blocks: integrators, multipliers, etc., for con- 
tinuous variables; and flip-flops, gates, count- 
ers, etc., and “digital” circuits for discrete vari- 
ables. It is organized for convenient represen- 
tation of an “analogous” physical system by 
means of a computer model constructed of these 
building block (Figure 2). 



Figure 2. The modem analog computer must be char- 
acterized as a parallel machine. Not solely a computer 
for continuous variables, it is a parallel assemblage of 
building blocks: integrators, multipliers, etc, for con- 
tinuous varialiles; and ilip-fiops, gates, counters, etc., 
and “digital” circuits for discrete variables. It is 
organized for convenient simulation of an “analogous” 
physical system by a computer model that is constructed 
of these building blocks. 


Conversion Devices 

In providing data communication between a 
sequential computer and a parallel computer 
three kinds of devices are commonly used : the 
multiplexer, the analog-to-digital converter 
(ADC), and the digital-to-analog converter 
(DAC). In addition, all early systems have 
employed a timing and control unit which per- 
forms a relatively fixed set of operations, with 
manual switches to select options such as 
sampling frequency, and number of channels. 
Such “linkage systems” thus consisted of a 
timer unit plus a group of linkage building 
blocks prewired to perform a specific task. With 
the integration of digital logic components into 
the parallel computer, however, greater pro- 
gramming flexibility is possible by use of these 
logic units for timing control of the data con- 
versions. Furthermore, the converters and 
multiplexer can act very naturally as additional 
building Blocks in the parallel computer. Thus 
it is likely that future hybrid systems will 
simply incorporate the “linkage system” within 
the parallel computer. 

Usually several or many analog signals in a 
hybrid program will be sampled, converted and 
transmitted periodically to the sequential digi- 
tal program. The numbers, of the several 
sequences of numbers to be entered into the core 
memory, can be accepted only one at a time. 
Since this is so, the conversions from voltage 
to number form can be performed one at a 
time— first from one analog variable and then 
from another. The multiplexer is used to select 
one from many analog signals, to step through 
a sequence of signals, and thus to furnish volt- 
age input signals to the ADC. 

The output of commonly used ADC’s is a 
binary number of 10 to 14 bits. A 13 bit binary 
output probably is the best compromise; it 
represents a resolution of one part in eight 
thousand, and resolution of analog voltage sig- 
nals is at best one part in ten thousand. Conver- 
sion times range from 50 to 300 microseconds. 
A typical time of 100 microseconds would allow 
the converter to be shared by 16 analog signals 
each with a frequency spectrum extending to 20 
or 30 cycles per second. This will be explained 
later. 

DAC units should have the same binary word 
size as the ADC, except for special low accuracy 
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uses. Conversion times are not determined by 
the converter but rather by the bandwidth of 
the analog amplifier following the converter. 
Data from the sequential computer appear only 
one word at a time, and some means for retain- 
ing the latest value, of each sequence of num- 
bers, for each output function is needed. The 
sequence of numbers coming from the computer 
may first be converted to voltage values by a 
single DAC, and then distributed to storage 
amplifiers for each channel. It is more cus- 
tomary, however, to hold each of the latest 
words for each channel in a digital register 
which is an integral part of the DAC assigned 
to each channel ( Figure 3 ) . 

Special Forms 

As a passing thought it may be noted that 
while the primary emphasis here is being 
placed on the distinction between parallel and 
sequential operation, the term “hybrid,” his- 
torically, has been used to imply the combina- 
tion of continuous and discrete calculations, and 
that therefore consideration might be given to 
two special kinds of hybrid computers: 

The Parallel Hybrid Computer, which is a 
proper term for the EAI HYDAC 2000 machine. 
This system combines an analog computer with 
a general purpose system of programmable 
logic building blocks, multiplexer, ADC, DAC’s, 



Figure 3. ADC's and DAC's are format converters, 
changing voltage to numbers and vice versa. As com- 
ponents of the parallel machine the converters together 
with logic components must act to ‘‘match impedances,” 
i.e., resolve the incompatibility between the parallel 
and sequential programs. 


digital memory units for storage of sampled 
analog functions, and several digital numerical 
adders and subtracters. The application of this 
system encompasses an intermediate range of 
hybrid problems, such as: 

a. Transport delay simulation®^ 

b. Single and multi-variable function genera- 
tion 

c. Logic control systems®* 

d. Automation of the analog computer for 
parameter searches and optimization 
studies^* 

e. Simulation of numerical and sampled data 
control systems*-* 

The Sequential Hybrid Computer, which is 
exemplified bytthe experimental “pulsed analog 
computer” t^niques developed at M.I.T. for 
use in an aircraft flight trainer.** This system 
employs a f:sequential digital computer which 
controls a small number of analog functional 
components — one multiplier, one reciprocal 
generator, one integrator, and several adding 
units. These units are interconnected and re- 
ceive inputs by digital program control. They 
form, in effect, “analog subroutines” for the 
sequential computer. 

THE SEQUENTIAL/PARALLEL HYBRID 
COMPUTER 

The term “hybrid” is most appropriately 
used to indicate the combined use of sequential 
and parallel computing techniques, first because 
the future growth in hybrid simulation will be 
predominantly in this d irection. Second, and 
more important, is that from the standpoint of 
the programmer who must bring the two t 5 T)es 
of computers together to find a useful solution 
to a problem, the only really significant dis- 
parity lies between sequential and parallel op- 
erations. The difference between continuous 
voltage and discrete number is simply one of 
format. It would make little difference if the 
analog signals were frequency modulated, pulse 
code modulated, hydraulic, or pneumatic — 
appropriate format converters could be found. 
The feature of hybrid computation that is of 
importance is : in part of the machine many op- 
erations are taking place simultaneously, and 
many time-varying problem variables exist in 
parallel; while elsewhere a number of opera- 
tions take place, one at a time in a repetitive 
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manner, so as to effect the generation of several 
problem variables, as if they occurred simul- 
taneously. Furthermore, the parallel computa- 
tion is tied to a real-time base : the very passage 
of time itself accounts for the changing of the 
basic independent computer variable. The 
sequential program is asynchronous — ^not con- 
trolled by a clock. Operations are executed in 
sequence at whatever rate is possible, and for 
any reference to be made to the actual elapsed 
time, external communication is necessary. This 
basic incompatibility requires that the interface 
between the two types of operation embody 
more than the simple format conversions i)er- 
formed by the ADC and DAC’s. It is necessary 
for data and control information in the parallel 
machine to be available to the sequential ma- 
chine and conversely that the latter be able to 
send data and control signals to many points in 
the former. Coincidence or simultaneity of 
events communicated to and controlled from the 
sequential program are particularly difficult to 
handle. The logic and data control of the inter- 
face equipment must resolve these differences 
in timing and operation. What might be termed 
an “impedance matching” device is needed be- 
tween the parallel and sequential program in 
order to make most efficient use of both ma- 
chines. The exact manner in which this is done 
will vary from problem to problem (Figure 4). 

A Simple Example 

An example will illustrate some basic con- 
siderations in defining a general purpose hybrid 
system. First, the operation of the simplest of 
linkage systems: an ADC and multiplexer, a 



Figure 4. A hybrid computer is a compatible system of 
parallel computing components, both digital and analog, 
and a stored progrram sequential machine. The hybrid 
computer programmer must constantly be aware of the 
relative timing of events in the parallel and sequential 
parts of the program. “Matching impedances” between 
these parts is accomplished by programming of the 
interface components to suit the simulation. 


number of DAC’s, and an “interrupt clock.” 
The flexibility of the stored program digital 
computer is relied on for control of these units. 
Assume 10 analog signals are to be converted 
one at a time. These words are placed in mem- 
ory (average program time: 40 /tsec. per word 
and then about 7.5 or 8.0 milliseconds of sequen- 
tial, digital calculations takes place, followed by 
output from memory of ten words (20 j^sec. per 
word) to ten DAC’s. The entire cycle requires 
7500 -f 10 X 40 (input) -f 10 X 20 (output) = 
8100 microseconds of digital program time. If 
it is assumed the conversion of the data (A to 
D) requires 150 microseconds per word then 1.5 
additional milliseconds, or 9 ms are needed if 
everything proceeds sequentially. Assume 
further that because of the frequencies, or the 
analog signals, it is necessary to sample at least 
some, and therefore all, of the channels at 100 
samples per second. A “reahtime interrupt 
clock” is set at 100 cycles per second. This 
timer unit is an adjustable oscillator that sends 
an interrupt pulse to the digital computer. The 
latter then activates the ADC, waits 150 fxsec 
for completion of a “Ready” signal, steps the 
multiplexer to the next channel, stores the 
converted word in memory, and then repeats 
this cycle ten times. With the tenth step the 
multiplexer resets to the first channel. The 
program then proceeds with the 7.5 millisec- 
onds of calculation, outputs ten words, one at a 
time, to ten DAC’s, and then waits for the next 



Figure 5. An early hybrid system configuration. The 
sequential program controls the timing of the conver- 
sion cycle. The cycle is initiated by the “interrupt 
clock.” For a tyical problem the clock might be set 
for 100 cps; and 7 to 8 milliseconds per cycle would be 
available for calculation. 
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INTERRUPT ADC COMPLETION 



Figure 6. The sequential program flow diagram, for 
the example hybrid system of Figure 5, shows the steps 
required for control of the converters. 



Figure 7. Typical times are shown for the steps in the 
conversion cycle of the example system of Figure 5. All 
ten channels are converted and stored in memory as 
fast as possible before proceeding with calculations — 
at the expense of intervals of “waiting time.” 


interrupt pulse. (See Figures 5, 6, and 7.) 
Manual controls are provided for selecting the 
interrupt clock frequency and the number of 
channels in the multiplexer stepping cycle. 

This is certainly a simple system and it ap- 
pears to satisfy the basic requirements for com- 
munications. Some of the shortcomings of the 
system are apparent: sampling and outputting 
of each channel do not take place simulta- 
neously, 15% of the sequential program time is 
“waiting” time, and 3 to 5% is used to select 


and control devices external to the sequential 
computer. In other problems these percentages 
may be higher. The other weaknesses in this 
system lie in the fact that it was not designed 
to be a general purpose system. It is restricted 
in application to a class of problems for which 
the periodic “input/calculate/output” cycle is 
useful. 

System Improvements or Variations 

By programming the parallel digital com- 
ponents of the parallel computer to perform 
timing and control functions for the system the 
following changes to the above system are 
suggested : 

a. Simultaneous sampling. If the sequential 
program operates on two or more of the input 
numbers together to calculate an output, then 
errors may occur since the input numbers were 
sampled at different times and correspond to 
different values of the independent variable. A 
similar effect may occur at the output since the 
numbers in a group of ten appear at the ten 
DAC’s at different times. It is certainly pos- 
sible by numerical means to compensate for the 
errors, at the penalty of additional program 
time.**’ 2* The common solution is to add mem- 
ory to each of the ten input and ten output 
channels. Ten Track/Store amplifiers are added 
before the multiplexer and a control signal 
causes them all to sample, by storing the volt- 
ages, simultaneously. At the output, 13 bit 
registers are added in front of the DAC’s. 
When all ten registers have been loaded, a 
transfer pulse causes all DAC values to be 
updated at the same time. 

b. External timing of ADC and multiplexer. 
Sequential program time can be saved by per- 
mitting the control of the ADC, multiplexer, 
track store cycle to be controlled externally. A 
simple clock, counter, flip-flop, and group of 
logic gates will permit the input conversion 
cycle to run at its own rate — interrupting the 
sequential program only at the completion of a 
conversion. Thus the conversion time can over- 
lap the calculation time, eliminating the wait- 
ing time. Upon interrupt only 10 to 20 ^sec. 
may be required per sample ; many control steps 
are eliminated. Similarly on output, the ad- 
dressing and selection of output channels can 
be done by simple circuits rather than using 
sequential program time (Figure 8). 
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Figure 8. Several improvements to the interface system 
of the example of Figure 5 include: simultaneous 
sampling, external ADC and multiplexer timing hy a 
parallel logic program, use of sense lines to reduce 
conversion channels, detection of random events, 
multiple sampling frequencies, and/or asjmchronous 
sampling. 

c. Real-time clock to establish sampling fre- 
quency. If the sequential calculation involves 
numerical integration over a long term, the 
accuracy of the sampling interval is just as im- 
portant as the round-off and truncation error 
in the numerical calculation. Although numeri- 
cal means may be resorted to for very accurate 
integration, in a hybrid program the calcula- 
tions still need to be referred to a real-time 
base. This is done by using an accurately cali- 
brated source for setting the sampling interval, 
or frequency. A good high-frequency crystal 
stabilized oscillator is an important part of the 
parallel digital subsystem. Sampling frequen- 
cies lower than the oscillator frequency are 
selected by use of preset counters. 

d. Use of sense Vines to reduce number of 
conversions. In the simple example problem 
only whole number data are transmitted to the 
sequential program. Thus, if the relative mag- 
nitude (greater or less than) of two analog 
signals is needed in the digital calculation, the 
two numbers must be converted, stored, and 
then compared. This can be accomplished more 
simply by use of an analog comparator the out- 
put of which is sent to the digital computer by 
a sense line — ^saving time and equipment. The 
state of any parallel logic component may be 
monitored conveniently by sense lines, l^ese 
are tested in one memory cycle (2-5 /nsec.). If 


many such communications are needed the sav- 
ings will be significant. Sense lines should also 
be added to allow the sequential program to 
determine the modes of the analog computer, 
the relative sizes and signs of error quantities, 
and the states of recording devices. 

e. Detection of random events. With fixed, 
periodic sampling the sequential program can- 
not tell exactly when events take place in the 
parallel machine. With comparators and parallel 
logic, complex functions of analog variables can 
be monitored. For example, it might be required 
to determine when the overshoot in Xi exceeds 
X 2 after the third cycle, but only when Xs is 
negative and X 4 is less than X 5 . After determi- 
nation the sequential program can be inter- 
rupted to perform specific conversions and cal- 
culations — asynchronously with respect to the 
primary conversion cycle. In this manner the 
parallel logic avoids the delays in the sequential 
program and uses the latter only when required. 
The parallel logic program analyzes the data, 
interrupts the sequential program, and sets up 
the proper channels for conversion in and out 
of the digital computer ( Figure 9). 

f. Multiple sampling frequencies. In the ex- 
ample problem all channels are sampled at a 
frequency determined by the highest frequency 
present in any one channel. It may often be the 
case that there are two or more groups of vari- 
ables with different ranges of variable frequen- 
cies. It may then be appropriate to sample each 
group at different frequencies. Another ap- 



Figure 9. A parallel logic program is used to detect 
random events in the parallel computer. The “desired 
control signal” interrupts the sequential program when 
the overshoot in Xi exceeds xa after the third oscilla- 
tion; but only when Xs is negative and x, is less than 
Xs. Xi through Xs are analog voltages. 
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proach using different sampling rates is to use 
several eight channel multiplexers in cascade so 
that the output of two of them feed two chan- 
nels of a third which feeds the ADC. On each 
cycle of the third unit the first two are stepped, 
yielding different variables for those two chan- 
nels on each cycle. Alternatively, each time the 
third steps to the two special inputs the corre- 
sponding multiplexer makes a full cycle. Tim- 
ing control of these operations is performed by 
parallel logic components (Figure 10). 

g. Asynchronous sampling. A completely 
asynchronous conversion system has been de- 
signed by one computer laboratory, in which 
the sequential program is interrupted only by 
comparators. Twenty analog problem variables 
are compared to reference values that are ad- 
justed by the digital computer when necessary. 
Each comparator calls for conversion of some 
group of variables (the same variables may be 
called for by different comparators) . When two 
or more comparator signals occur simultane- 
ously or during a conversion operation, two 
levels of priorities are set up by logic elements 
to determine what interrupts are to be made. 
While the system appears complex, it is accom- 
plished in a simple fashion in the parallel com- 
puter and makes good use of the sequential com- 
puter time. 

A longer list of useful variations in the con- 
trol and timing of sequential/parallel communi- 
cations can be compiled. For the most part, 
however, they should be explained in terms of 
the particular problem applications. 



Figure 10. Cascaded multiplexers, in this or other con- 
figurations, can aid in establishing multiple sampling 
rates for different groups of analog signals. 


Operating Times for Typical Mathematical 
Functions 

The repeated emphasis upon the efficient utili- 
zation of the sequential program time, high 
arithmetic speeds, and programming tricks to 
gain speed can be seen to be warranted when 
one examines the sequential operating times for 
several tsnpical mathematical functions, which, 
on the analog computer, would be executed con- 
tinuously and in parallel. 

a. The sum : a -j- b -f- c -f d 40 microsec. 

b. The expression : ax -f- by cz 

160 microsec. 

c. Sinwt or coswt 215 microsec. 

d. Square root of x* -)- y2 432 microsec. 

e. Generate z = f (x, y) , where 
two dimensional interpola- 
tion is required between 
functional values evenly 

placed in x and y : 0.5 to 1.5 millisec. 

f. Rotate a vector through 

three coordinate angles : 2 to 6 millisec. 

g. Perform one integration of a 
single derivative for a single 

time step : 0,1 to 1.3 millisec. 

A program of three first-order differential 
equations, where the derivatives are calculated 
from the functions above (items a through f) 
would not be a large program; and yet for a 
single step in time, the calculation time would 
be about 11.2 milliseconds. 

dx 

^ — x + y -l-z + f(x,y) 

^ = ax — by -1- cz 
^ = X* -f- y2 -f sinzt 

Allowing another millisecond or two for control 
and input/output instructions one can estimate 
the real-time speed performance of this pro- 
gram. The speed is best expressed in terms of 
the useful upper frequency (at full scale), in a 
problem variable, that can be represented by 
the computer. Although the example equations 
have no real meaning, the frequency limit for 
such a program is about 1^ cps. This does not 
seem like very fast performance for so few, 
simple equations. On the other hand, it is fast 
compared to frequencies of some of the vari- 





260 


PROCEEDINGS— -SPRING JOINT COMPUTER CONFERENCE, 1964 


ables in an aerospace simulation program for 
which digital precision is required. In any 
event, it should be clear for best utilization of 
the sequential computer, care should be taken to 
reduce operating time whenever possible. 

Sampling Rates 

When parallel and sequential machines are 
connected in a closed loop it is assumed that at 
least part of the task of the sequential program 
is to accept sequences of sampled values of con- 
tinuous inputs, calculate functions of these in- 
puts, which are then transferred out as 
sequences of numbers to be smoothed into con- 
tinuous signals. The digital computer attempts 
to appear as if it were a parallel computer, and 
as in a movie projection the effectiveness of 
this approximation is determined by the ratio 
of the frame or cycle rate to the rates of change 
of the signals communicated, and the time reso- 
lution of the person or computer receiving the 
information. Thirty frames per second will not 
catch the information in the trajectory of a 
humming bird. A higher rate is needed for an 
accurate recording of the flight. The human 
eye, however, cannot resolve time intervals less 
than 1/30 of a second. Thus an accurate record- 
ing can be transmitted to the eye only by a time 
scale change to slow motion. Fortunately, the 
parallel computer has a time resolving power 
sufficient to detect the shortest practical cycle 
time on the sequential computer, so the limiting 
factor in determining useful cycle frequencies 
is the rates of change of the variables that pass 
between the two computer sections. It is cus- 
tomary to speak of the bandwidth or spectrum 
of these signals — or more particularly the high- 
est useful frequency that must range over the 
full magnitude scale. The sampling rate or 
cycle rate must be selected in terms of the num- 
ber of discrete numbers or voltage samples 
necessary to represent this highest frequency at 
the desired accuracy. 

In sampled data theory the “sampling 
theorem” states that the sampling frequency 
must exceed two times the highest signal fre- 
quency if all the information in the original 
signal is to be retained.®^ That is, some number 
greater than two samples per cycle is necessary. 
Another important point comes from the 
theory : in sampling voltages at the input to the 
digital computer the rate must exceed twice the 


frequency of any signal present. If noise sig- 
nals are present that are higher in frequency 
than the desired signal and exceeding the 
sampling frequency it is possible for this noise 
to be reflected into the signal spectrum, thereby 
destroying useful information. This can be 
avoided with appropriate noise filters. If this 
were the only limitation it would be fortunate. 
However, too few samples per cycle makes it 
difficult for the sequential program to extrap- 
olate and predict what takes place in between 
samples. It is possible to do this, of course, by 
numerical means, but at cost in program time. 
Furthermore, numerical algorithms for integra- 
tion are sensitive to the ratio of sampling inter- 
val to the rates of changes of the variables, and 
the calculations may become unstable if too few 
samples per cycle are used (Figure 11). 

The most important criterion for determin- 
ing sampling rates appears in the conversion of 
the discrete data to continuous analog func- 
tions. Two sources of error affect the accuracy 
of the resultant continuous function. The first 
is the delay due to conversion and the sequential 
program itself. The output numbers are func- 
tions of input numbers that were sampled at an 
earlier time. Since the delay is unavoidable, but 
is predictable, numerical means are used to 
extrapolate the data to the time of actual digital- 
to-analog conversion. second error 
source is in the mechanism for conversion from 



Figure 11. A Sequential-Parallel Hybrid Computer: 
The noise and delays due to sampling and multiplexing 
on the one side and the magnitude and phase errors due 
to demultiplexing and smoothing on the other are domi- 
nant factors in determining the proper sampling fre- 
quency. 
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discrete to continuous form. A sequence of dis- 
crete values fed to a DAC results in a “stair- 
case” analog function. The desired output is a 
smooth curve passing through each data point 
(the left corner of each step). If the staircase 
is smoothed with an analog circuit the result is 
a smooth curve shifted in time i/^At, passing 
through the center of each step. The amplitude 
of this curve is attenuated from what it should 
be. This technique of smoothing is called “zero 
order” filtering. (See Figure 12.) The size of the 
errors is a function of the number of samples 
per cycle. At ten samples per cycle the magni- 
tude attenuation is about 1.1% and the phase 
shift is 18 degrees. At 30 samples per cycle the 
errors are 0.7 % and 6.5 degrees. 

A first order filter may be applied to the DAC 
output to reduce the errors. For special pur- 
poses higher order extrapolating filters are 
feasible. These filters are programmed from 
analog components. The first order filter extrap- 
olates from the last two discrete values to gen- 
erate intermediate values until the output volt- 
age is reset to the next discrete value from the 
DAC. The first order filter has a much improved 
phase characteristic but at low sample rates the 
magnitude is erroneously accentuated. The 


IDEAL 

OUTPUT 




Figure 12. The discrete to continuous signal conversion 
requires smoothing and hence yields only an approxi- 
mation to the ideal output from the sequential com- 
puter. The zero order conversion simply holds the 
output voltage at the last sampled value until the next 
arrives. When the “steps” are filtered out the result is 
shifted in time by the width of one half step. Thq first 
order scheme uses the past two converted values to 
predict the value between points, before “resetting” to 
the next value. 


3 



Q. 

Figure 13. Zero and first order conversion methods are 
compared, after filtering. The errors are functions of 
the number of samples per cycle of full scale signal 
frequency. A good rule of thumb for the zero order 
filter is that 50 samples per cycle, the errors are less 
than 0.1% and 3.6 degrees. 


error characteristics of the two filters are 
shown in Figure 13. A good rule of thumb for 
the simple zero order filter is that for 50 
samples per cycle the errors are less than 0.1% 
and 3.6 degrees. The above rule is convenient 
for estimating the required sample rate and 
hence the time available for the sequential pro- 
gram. If the variables converted from digital 
vary at a maximum frequency of 2 cps, then 
100 samples per second are needed, and 10 milli- 
seconds is the cycle time for the sequential pro- 
gram, for primary calculations and input/out- 
put operations. 

Dynamic Range of Dependent and Independent 
Variables 

The time resolving power of the analog com- 
puter was mentioned above in connection with 
an analogy to the resolution of the human eye. 
It is instructive to pursue this concept further. 
The time resolving power of a computer is 
measured by the shortest time interval that can 
be accounted for in a calculation. For all signals 
in an analog computation, the resolution is 
directly related to the bandwidth of the com- 
ponents ; however, the computer’s ability to re- 
spond to on-off signals and very short pulses, or 
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to discriminate between two closely spaced 
events is a closer description of time resolution. 
In a digital computer the absolute minimum 
resolution might be taken as the time to execute 
three instructions; however, within a hybrid 
system the resolution of the sequential program 
is either the sample interval discussed above, or 
at best, for asynchronous operation, the time 
for a complete interrupt program to respond to 
an event. In the parallel computer the time 
resolution is, of course, much greater because 
computing elements need not be time-shared. 
For relay circuits the resolution is about 1 milli- 
second, for electronic switching of analog sig- 
nals from 10 to 100 wseconds, and for parallel 
digital operations from 0.1 to 10 ^seconds. If 
these numbers are compared to the total length 
of a typical computer run, say 1.5 minutes, com- 
puter time resolution can be measured by a non- 
dimensional number: 

Parallel digital logic operations • 1: 10* 


Parallel digital arithmetic 
operations 1 : 10'' 

Parallel analog, electronic 
switching 1 : 5 X 10* 

Sequential program, minimum 
useful program 1 : 3 X 10® 

Parallel analog, relay switching 1 : 10® 

Sequential program, typical 
sampling 1 : 5 X 10® 


The digital computer is employed in a simu- 
lation where the dynamic range of dependent 
variables requires a wide dynamic range (re- 
ciprocal of resolution) in the computation. It is 
seen that resolution of the independent variable 
is traded for that of the dependent variables 
when a particular calculation is moved from 
analog to digital computer. Figure 14 shows 
these functions plotted against each other for 
different computers. The flat part, or “operat- 
ing range,” of the sequential computer plot is 
seen to be limited on one end by truncation error 
and the other by round-off error. This is to be 
interpreted as meaning that for a given set of 
mathematics short cuts and approximations 
may be used to improve the time resolution up 
to a point where the truncation error becomes 
serious. On the other end, special techniques 
may be used to reduce round-off error, including 
double precision operations, at the expense of 
time resolution. 



INDEPENDENT VUHA8LE (TNIE) RESOtUTION 

Figure 14. Certain performance characteristics of com- 
puters can be deduced from this comparison of the 
available resolution in the computer representation of 
dependent and independent variables. Notes: (1) The 
limits on the “Parallel Digital” curve are given as those 
of the EAI Hydac, 0.5 /isec. in time and 16 bits in 
magnitude; (2) the attenuation on the right hand end 
of the DDP-24 and 7094 curves corresponds to trunca- 
tion errors; (3) the slope at the left of these curves 
corresponds to reduction of round-off errors at the 
expense of speed including use of double precision 
methods (broken lines) ; (4) The "Real Time Fre- 
quency” scale refers to sign^al frequencies passed by the 
sequential program (DDP-24, 7094) assuming a maxi- 
mum sampling rate of 100 samples /cycle. 

When the digital computer curves are related 
to the frequency scale, at the top of Figure 14, 
a particular size program must be considered. 
For example, at the 2 cps point, six second order 
differential equations for a trajectory simula- 
tion could be calculated, for that point corre- 
sponds to a 10 millisecond (bottom scale) se- 
quential program time interval. At the 20 cps 
point, either very crude integration algorithms 
and approximations are used for the same prob- 
lem, or one would be considering a much smaller 
calculation. Moving to the left on the curve, 
more time is available either for more accurate 
calculation or for computation of more func- 
tions. The useful operating ranges for the dif- 
ferent techniques are evident from this figure, 
and this point of view should be useful to the 
problem analyst in considering hybrid simu- 
lation. 

FORMULATION OF THE SIMULATION 
MODEL AND PROGRAMMING 

Mathematical analysis of the behavior of 
physical processes and systems is a basic tool 
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for the design engineer, and the frequency of 
its use has been growing for many years at an 
increasing rate. At first, analysis was restricted 
to the smaller elements in a system, to linearized 
approximations, or to phenomena that can be 
isolated from interaction with its environment. 
For example, there have been many studies of 
noninteracting servo control loops, heat diffu- 
sion in devices of simple geometry, the “small 
signal behavior” of aircraft and their control 
systems, and batch and continuous chemical 
reactors of simplified geometry. Analysis starts 
from a consideration of the basic laws of physics 
as applied to the process at hand and proceeds 
to develop a mathematical model. The solution 
to the equations of this model for a range of the 
independent variable (s) constitutes a simula- 
tion of the process. The nature of the designer’s 
task and the very fact that the analysis has been 
limited to an element of a more complex system, 
requires that many such solutions be calculated. 
The simulation is performed numerous times 
over to determine the variations in the process 
behavior with changes in (a) internal design 
parameters of the process, and (b) environmen- 
tal conditions. Electronic computers, of both 
types, have aided immeasurably in reducing this 
task to a manageable one. The facility in ob- 
taining simulation results that computers have 
afforded the designer and analyst has acceler- 
ated the general acceptance of the analytical 
approach to difficult design problems. 

In addition to the wider use of simulation the 
successful correlation of experimental results 
with analytical predictions has built confidence 
in these techniques, which has led to the under- 
taking of simulations of more complex systems. 
It was once felt that a simulation model could be 
made so large that the analyst would have diffi- 
culty coping with the variables. Indeed this can 
happen, when poor engineering judgment leads 
to a model with many more variables than 
known conditions and assumptions. However, 
a complex model carefully built up from veri- 
fied models of subsystems may lead to valuable 
results attainable by no other means. Thus as 
analysis and simulation have yielded under- 
standing of the behavior of small systems, a 
natural process of escalation has led to simula- 
tion of systems of greater and greater com- 
plexity. The increased sophistication of simula- 
tion models has made the analyst even more 


dependent upon computers for effective control 
of the simulation and for interpretation of 
results. 

The rapid growth of analytical methods and 
the exploitation of computer technology have 
paralleled an even faster expansion in com- 
plexity of engineering systems. Aerospace 
systems, moon missions, space satellite labora- 
tories, nuclear reactor power systems, and 
automated chemical plants are examples of en- 
gineering systems that are so expensive and/or 
potentially hazardous that the design cannot be 
undertaken without computer simulation to pre- 
dict ultimate performance. It is no longer feasi- 
ble to restrict analysis to linearized or isolated 
subsystems in development of such systems. As 
technology continues to expand reliable methods 
must be found to predict, by simulation, the per- 
formance of total systems. For only with such 
analysis and prediction can decisions, involving 
capital investment as well as design features, be 
made. It is in this context that hybrid computa- 
tion can be seen to fulfill a growing need. 

One might well ask what are the implications 
of this esc^ation of complexity. If simulation 
models must necessarily grow larger, just how' 
does this affect the procedures of analysis, com- 
puter programming, computation, and interpre- 
tation of results? How are the hazards, which 
were of earlier concern, of becoming over- 
whelmed with useless data and meaningless 
computation be avoided? There is a pernicious 
theory about programming for very large digi- 
tal simulation, that says if two men can do the 
job in six months, four men will take twelve, 
and eight men would never complete it. How 
can the step from mathematical model to the 
first computer simulation run be held within 
bounds — ^to avoid inordinate investment in pro- 
gramming that may never work or may have to 
be scrapped for a better approach? How can 
the analyst or design engineer stay in touch 
with his model? 

Surely there are no answers that yield guar- 
anteed results. But these are serious questions 
and some direction is needed in order to eval- 
uate properly the true potential of advanced 
computer techniques. The implications in the 
field of hybrid simulation may be divided into 
three categories : 
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(a) Model building in programming 

(b) Software 

(c) Automation 

Model Building in Programming 

The analyst, design engineer, and program- 
mer of a large hybrid simulation must all (if 
they are more than one person) become in- 
volved in all phases of the simulation process. 
Responsibility cannot be divided up, as it often 
is at the digital “closed-shop” facility, between 
analyst and computer programmer. The hybrid 
computer laboratory must be operated, as many 
analog laboratories, on an open-shop basis with 
expert programming support available from 
the laboratory. The design engineer must have 
a genuine understanding of the computers to be 
used, even though he may not do the actual 
computer programming. Since the computer 
actually becomes the model of his system, he 
must know the limitations imposed by the 
machine as well as by the mathematics, and he 
must be able to communicate effectively with 
the computer. Moreover, during the construc- 
tion of the mathematical model the analyst must 
keep in mind the features of the parallel and 
sequential parts of the hybrid computer in order 
to achieve a proper partitioning of the mathe- 
matical model to suit the computer. 

Much attention has been given here to the 
relative speeds of computation inherent in the 
different computing techniques. It may be evi- 
dent at this point that the presence of a very 
wide range of signal frequencies in a system to 
be simulated is the one characteristic that most 
clearly indicates the need for a hybrid computer. 
As an example, consider the simulation of a long 
range flight of a space capsule. In “real time” 
the ijosition coordinates probably vary at 0.01 
cps over most of the range, and, at most, at 
3 cps during launch and re-entry. At the same 
time, pitch, roll, and yaw rates and thrust forces 
may reach 10 cps or more. Adaptive control 
functions and control surface forces may have 
transient frequencies as high as 50 cps; and a 
simulation of reaction jet control forces may 
require torque pulses as narrow as one milli- 
second. Since there is little or no damping in 
an orbital flight, these pulses have a long term 
effect, and accuracy in their representation is 
important. If an on-board predictive computer 


is included in the simulation, iterative calcula- 
tions on the analog computer may involve signal 
frequencies of 100 to 1000 cps. Thus, this simu- 
lation spans a frequency range of 10® as well as 
a dynamic range in some dependent variables of 
10® or 10* (Figures 14, 15, and 16). 

The following observations may be fairly ap- 
parent, but in considering division of a problem 
between computers it is well to note the types of 
mathematics for which each is best suited. The 
forte of the digital computer is the solution of 
algebraic equations. If the equations are ex- 
plicit, the calculation time is easily determined. 
Implicit equations often require a variable 
length of time, and if there are not too many of 
them they may be readily solved continuously 
on the analog computer. Numerical integration 
comes as a by-product of the computer’s power 
in solving algebraic problems. Time is the only 
penalty; If the high precision is not needed, the 
integration is better done by the analog com- 
puter, for the solution of ordinary differential 
equations is its strong feature. 

Evaluation of arbitrary functions is per- 
formed with ease by both computers, as well as 
by parallel digital components; however, if 



Fast Variables 


Figure 15. The relative speeds of computation inherent 
in the different computing techniques are important 
considerations in the planning of a computer simulation 
of a large complex system. The presence of a wide 
range of signal frequencies in a system to be simulated 
is the one characteristic that most clearly indicates the 
need for a hybrid computer. 




HYBRID COMPUTATION . . . WHAT IS IT? . . . WHO NEEDS IT? . . . 


265 




Figure 16. (Note: place figure horizontal with the 
word ‘‘time” on the left). A wide range of signal fre- 
quencies is suggested here: although not to scale, for 
a dynamic range of 10 ^ could not be illustrated con- 
veniently. Consider the simulation of a long range 
flight of a space capsule. The curves might represent 
(1. to r.) : position coordinates varying at 0.01 cps; 
deviation from a desired path, 0.1 cps; pitch, roll, or 
yaw rate, 1-5 cps; reaction jet control pulses, 1 ms. 
pulses at several hundred pulses per sec. ; thrust forces 
or control surface transients, 1-50 cps; iterative 
calculations for trajectory predictions, 100-1000 cps. 

speed is important, and if the data are func- 
tions of two or more variables, a digital pro- 
gram is the best choice. Simulation of non- 
analytic non-linear functions, such as limits, 
back-lash, dead-zone, stiction, and hysteresis 
again are amenable to both techniques but ana- 
log is probably more economical. Evaluation 
of trigonometric and hyperbolic functions also 
can be done both ways and the choice seems to 
depend on the particular problem. In this case 
there is a third choice, for there are techniques 
and equipments for executing these functions 
by parallel digital components.*’ 


Logic equations that must be evaluated con- 
tinually with respect to their relation to analog 
variables clearly must be programmed with 
parallel logic elements. On the other hand, de- 
cision and control functions that have to do 
with the occasional evaluation of states of the 
computer and errors signals and with sequenc- 
ing various sections of the total system through 
different modes and states of operation, may 
require both parallel and sequential operations. 

Hybrid simulation requiring solution of par- 
tial differential equations opens a whole new 
subject for discussion. Let it just be said that 
although there is very little practical experience 
in this field, it appears to offer one of the most 
promising areas of growth for hybrid simula- 
tion. The digital computer approach to the solu- 
tion of PDE is often limited by available com- 
puter time— particularly in simulation problems 
where it is desired to solve the problem many 
times for various conditions. The analog com- 
puter can solve some PDE problems very effi- 
ciently but it is often seriously limited by the 
necessity of large amounts of equipment for 
complex problems. Moreover, only with memory 
to store complete functions (either in a parallel 
digital system or a sequential computer) can 
certain kinds of boundary value problems be 
approached. The hybrid computer has the abil- 
ity to store boundary conditions as well as com- 
plete sets of intermediate solutions so that 
parallel computer programs can be used for 
speed, but then be time-shared over again for 
different parts of the space domain. Some very 
challenging problems and some promising pos- 
sibilities face the experimenter in this field. 

Returning now to the implications of the 
growth of complexity in simulation, the im- 
portant point in programming is that who- 
ever prepares the computer program must 
himself be a model builder and be familiar with 
the system to be simulated. The program should 
be designed, constructed, and checked out in 
much the same way that any other kind of 
model of a complex system would be built. The 
computer model should be put together from 
working models of subsystems. Each subsys- 
tem, or group thereof, is verified for correct 
performance in a linear or simplified mode be- 
fore connecting it to other parts of the model. 
At each point in the expansion of the model. 
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including the final one, at least one test case of 
a linear mode of operation should be checked 
against known or precalculated behavior. 

Computers can extend the analytical powers 
of man, but they cannot work magic. The com- 
puter is a storehouse of answers, but the burden 
is upon the analyst to ask meaningful questions 
if he is to receive useful answers. Only with a 
step by step process, asking questions of the 
computer at each step, can a valid and useful 
computer model be built for a large, complex 
system. 

Softuwre 

Standard programs and routines for digital 
computers, of general utility to programmers, 
known as “software,” are in such wide use that 
the production of software is virtually an in- 
dustry in its own right. “Automatic program- 
ming systems,” which make computer pro- 
gramming easy for the non-computer expert, 
are responsible for the almost universal accept- 
ance of digital computers in scientific research 
and development. 

The development of sophisticated software 
has made it possible to increase computer utili- 
zation, to gain wider use of computers with 
minimum training of personnel, and to reduce 
duplication of programming effort for programs 
of general utility. A total dependence upon 
automatic programming has the disadvantage 
of isolating the problem analyst, and even the 
programmer himself, from the computer. The 
analyst is restricted from communicating with 
his computer model while computation takes 
place. The programmer is limited in taking full 
advantage of the computer’s special features. 

On the other hand, in the analog computer 
field the reverse situation exists. No compa- 
rable “software” usage has shown up. There is 
little in the way of automatic programming 
and preserving of standard programs by analog 
computer programmers. However, another 
characteristic of simulation by analog computer 
is that the problem anayst is involved in build- 
ing the computer model and he maintains rap- 
port with his model during the simulation. 

The role that software must play in hybrid 
simulation of large complex systems is evident. 
“Hybrid software” must ease the programmer’s 


burden, as it does for the digital programmer, 
and at the same time it must bring the analyst 
closer to his model rather than isolating him 
from it. Hybrid software must include not only 
coding for the sequential computer but also 
interconnection diagrams and prewired patch 
panels for the parallel machine. The following 
types of software are needed to support growth 
of hybrid computation to meet the simulation 
needs of today. 

a. Compilers and Assemblers. Conventional 
compilers and intepreters have their useful 
place in hybrid computation, to aid in process- 
ing data prior to computation and processing 
results for interpretation. Automatic program- 
ming systems for hybrid computation may differ 
from conventional systems only in three ways : 
running time of the object program is mini- 
mized at the expense of compiling time ; actual 
running time for each program statement is 
precalculated or estimated to aid the program- 
mer with timing of the parallel/sequential inter- 
face; and while the programmer’s language is 
“problem oriented,” it is required to be machine- 
dependent. The programmer must be able to 
utilize special machine features and to program 
for control of all interface operations. 

b. Utility Library. In addition to the con- 
ventional utility routines for mathematical 
functions, format conversions, and input/output 
operations, the hybrid simulation library should 
expand with routines for specific transfer func- 
tions of useful subsystem models that have be- 
come standard and are used in larger models, 
e-g., a typical servo controller. Another type of 
example is a function generator program to any 
number of aerodynamic functions. A standard 
program for a complete aerodjmamic vehicle 
simulation is also feasible. 

c. Input/Output Routines. Direct on-line 
control of the computer model by the analyst is 
needed. In a convenient language it must be 
possible to experiment with time scales, param- 
eter values, and even to make substitution of 
mathematical algorithms (particularly integra- 
tion algorithms), without any penalty in run- 
ning time. This means that a complete symbol 
table and all definitions of parameters must be 
available to an executive routine that will ac- 
cept operator instructions to modify a particu- 
lar problem variable and proceed to calculate 
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changes in all the “machine variables” that are 
affected. The executor also permits interroga- 
tion of the state (and time history) of any 
problem variable, in engineering units. Upon 
operator command the model can be modified by 
changing the linkages between submodels or 
subroutines. It may, for instance, be desired to 
linearize part or all of the model for checking 
purposes, or to isolate or “freeze” certain parts 
of the model. In monitoring and adjusting the 
model the executor must not be limited to the 
sequential computer but should have full access 
to the parallel elements in the computer system. 

Automation 

One important aspect of hybrid computation, 
which is barely mentioned above, is the oppor- 
tunity for automating much of the routine parts 
of programming and check-out of the analog 
computer. It is evident that a necessary fea- 
ture of any hybrid computer system is the 
mechanization of, and sequential program con- 
trol of, as many of the manual operations on 
the analog computer as possible. This includes 
setting of potentiometers, switches, modes, time 
scales, recorders, and the selector and readout 
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some of the software functions mentioned 
above. Also it makes possible automatic set-up, 
testing, and diagnosis of machine and program 
faults. Some interesting diagnostic program- 
ming for such a hybrid computer was developed 
in 1959-60 at General Electric MSVD in Phila- 
delphia, 

A different type of programming automation 
is offered by the Apache system developed at 
Euratom, Ispra, Italy, for the IBM 7090 and 
PACE analog computers.i^ This is a digital pro- 
gram that translates a mathematical statement 
into detailed programming instructions for the 
analog computer. While Apache is not intended 
for hybrid computing the appropriateness of 
such a program should be apparent. 

One last important characteristic of hybrid 
simulation concerns the automation of the model 
building process itself. Simulation inherently 
involves trial and error experimentation. The 
elements of a model are verified; sensitivity to 
environment is explored ; and variation of per- 
formance due to parameter changes are evalu- 
ated. When a criterion for optimality can be 


specified, experiments are made to obtain opti- 
mum performance. The sequential computer 
is perfectly suited to the automation of these 
procedures. Between calculations the digital 
computer can evaluate the results, decide upon 
changes to the model or the data, and implement 
the changes. At the same time the analyst can 
monitor the progress of the simulation and 
interrupt the automatic process whenever hu- 
man judgment is required. 

CONCLUSIONS 

The main points developed in this paper may 
be listed simply. 

a. Hybrid computation is built upon the tech- 
nology of analog and of digital computers and 
is equally dependent upon the programming 
methods, software, and procedures of problem 
analysis that have been developed for each. 

b. A hybrid computer is a compatible system 
of parallel computing components, both digital 
and analog, and a stored program sequential 
machine. The hybrid programmer must be 
constantly aware of the relative timing of com- 
putEtionEl 8V6iits in the parallel and seQuential 
parts of the system. 

c. There is an ever growing need for simula- 
tion of very complex engineering systems. The 
process of analysis and building of a computer 
model for evaluation and prediction of behavior 
are a required step in many large development 
programs. The hybrid computer offers a means 
for many such simulations that would* be im- 
practical by other means. Hybrid computation 
is inherently a tool for very complex simula- 
tions rather than simple studies. 

BIBLIOGRAPHY 

1. Barnett, R. M., “NASA Ames Hybrid 
Computer Facilities and Their Application 
to Problems in Aeronautics,” International 
Symposium on Analogue and Digital Tech- 
nique Applied to Aeronautics, Liege, Bel- 
gium (September 1963). 

2. Bauer, W. F., and West, G. P., “A System 
for General Purpose Analog-Digital Com- 
putation,” JACM, Vol. 4, No. 1 (January 
1957), p. 12. 



268 


PROCEEDINGS— SPRING JOINT COMPUTER CONFERENCE, 1964 


3. Baxter, D. C., and Milsum, J. H., “Re- 
quirements for a Hybrid Analog-Digital 
Computer,” National Research Council of 
Canada, Mechanical Engineering Report 
MK-7, Ottawa (October 1959), and Amer- 
ican Society of Mechanical Engineers, 
Paper No. 59-A-304. 

4. Bertram, J. E., “Effect of Quantization in 
Sampled-Feedback Systems,” U.S. Govern- 
ment Research Reports, Vol. 31 (January 
16, 1959) , p. 19 (A) , PB 133 341. 

5. Birkel, G., Jr., “Hybrid Computers for 
Process Control,” Communication and 
Electronics, No. 52 (January 1961), pp. 
726-734 ; discussion, p. 734. 

6. Blanyer, C. G., and Mori, H., “Analog, 
Digital, and Combined Analog-Digital 
Computers for Real-Time Simulation,” 
Proc. EJCC (December 1957), p. 104. 

7. Burns, A. J., and Kopp, R. E., “A Commu- 
nication Link Between an Analog and a 
Digital Computer (DATA-LINK),” Grum- 
man Aircraft Engineering Corp., Research 
Department, Research Report RE-142 
(October 1960), ASTI A, No. AD 244 913. 

8. , “Combined Analog-Digital Sim- 

ulation,” Proc. EJCC, Vol. 20 (December 
1961), pp. 114-123. 

9. Cameron, W. D., “Determination of Prob- 
ability Distribution Using Hybrid Com- 
puter Techniques,” International Sympo- 
sium on Analogue and Digital Techniqties 
Applied to Aeronautics, Liege, Belgium 
(September 1963). 

10. Connelly, M. E., “Simulation of Air- 
craft,” Servomechanisms Lab. Report 7591- 
R-1, M.I.T. (February 15, 1959). 

11. , “Analog-Digital Computers for 

Real Time Simulation,” M.I.T. Report 
DSR8215, Final Report, ESL-FR-110 
(June 1961). 

12. , “Real-Time Analog-Digital 

Computation, IRETEC, Col. EC-11, No. 1 
(February 1962), p. 31. 

13. Cox, F. B., and Lee, R. C., “A High-Speed 
Analog-Digital Computer for Simulation,” 
IRETEC, Vol. EC-8, No. 2 (June 1959), 

pp. 186-196. 


14. Debroux, a., and Green, C., and D’Hoop, 
H., “Apache — A Breakthrough in Analog 
Computing,” IRETEC, Vol. EC-11, No. 5 
(October 1962). 

15. Feucht, K.S "Diagnostic Programs for a 
Combined Analog-Digital System,” Pro- 
ceedings of the Combined Analog Digital 
Computer Systems Symposium, Philadel- 
phia (December 16-17, 1960). 

16. Gaines, W. M., and Fischer, P. P., “Ter- 
minology for Functional Characteristics of 
Analog-to-Digital Converters,” Control En- 
gineering, Vol. 8, No. 2 (February 1961), 
pp. 97, 8. 

17. Gelman, H. D., “Evaluation of an Inter- 
cept Trajectory Problem Solved on a Com- 
bined Analog-Digital System,” Proceedings 
of the Combined Analog Digital Computer 
Systems Symposium, Philadelphia (De- 
cember 16-17, 1960). 

18. Gelman, R., “Corrected Inputs — A Method 
for Improving Hybrid Simulation,” Proc. 
FJCC, Vol. 24 (November 1963). 

19. Greenstein, j. L., “Application of ADDA 
Verter System in Combined Analog-Digital 
Computer Operation,” Pacific General 
Meeting, AIEE (June 1956). 

20. Halbert, P. W., “Hybrid Simulation of an 
Aircraft Adaptive Control System,” Proc. 
FJCC, Vol. 24 (1963). 

21. Hartsfield, E., “Timing Considerations in 
a Combined Simulation System Employing 
a Serial Digital Computer,” Proceedings of 
the Combined Analog Digital Computer 
Systems Symposium, Philadelphia (De- 
cember 16-17, 1960). 

22. Korn, G. A., “The Impact of the Hybrid 
Analog-Digital Techniques on the Analog 
Computer Art,” Proc. IRE, Vol. 50, No. 5 
(May 1962), pp. 1077-1086. 

23. Landauer, j. P., “Simulation of Space Ve- 
hicle with Reaction Jet Control System” 
(1962), EAI Bulletin No. ALHC 62515. 

24. , “The Simulation of Transport 

Relay with the Hydac Computing System,” 
EAI Bulletin No. ALHC 63011. 

25. McLoed, j. H., and Leger, R. M., “Com- 
bined Analog and Digital Systems — ^Why, 
When, and How,” Instrument and Automa- 
tion, Vol. 30 (June 1957), pp. 1126-1130. 



HYBRID COMPUTATION . . . WHAT IS IT? . . . WHO NEEDS IT? . . . 269 


26. Mitchell, B. A., “A Hybrid Analog-Digi- 
tal Parameter Optimizer for ASTRAC-II,” 
Proc. SJCC, Vol. 25 (April 1964). 

27. Mitchell, J. M., and Ruhman, S., “The 
Trice — A High Speed Incremented Com- 
puter,” IRE Nat. Conv. Record (1958), 
Pt. 4, pp. 206-216. 

28. Miuea, T., Iwata, J., “Effects of Digital 
Execution Time in a Hybrid Computer,” 
Proc. FJCC, Vol. 24, (November 1963). 

29. Noronha, L., “An Integrated General Pur- 
pose Hybrid Computing System,” Interna- 
tional Symposium on Analogue and Digital 
Techniques Applied to Aeronautics, Liege, 
Belgium (September 1963). 

30. Palevsky, M., “Hybrid Analog-Digital 
Computing Systems,” Instruments and Au- 
tomation, Vol. 30 (October 1957), pp 1877- 
1880. 

31. , “The Digital Differential Ana- 

lyzer,” Computer Handbook, Edited by 
G. A. Korn and H. 0. Huskey, New York : 
McGraw-Hill (1961), Chapt. 19. 

32. Paskman, M., and Heid, J., “Combined 
Analog-Digital Computer System,” Pro- 
ceedings of the Combinedi Analog Digital 
Computer Systems Symposium, Philadel- 
phia (December 16-17, 1960). 

33. Shapiro, S., and Lapidus, L., “A Combined 
Analog-Digital Computer for Simulation 
of Chemical Processes,” Proceedings of the 
Combined Analog Digital Computer Sys- 
tems Symposium, Philadelphia (December 
16-17, 1960). 

34. Shileiko, a. V., “A Method for Selecting 
the Optimum Structure of a Digital Analog 
Computer,” Automation and Remote Con- 
trol (Avtomatika i Telemekhanika) , Vol. 
22, No. 1 (August 1961), (originally pub- 
lished January 1961), pp. 76-81. 


35. Skramstad, H. K., Ernst, A. A., and 
Nigeo, j. P., “An Analog-Digital Simula- 
tor for Design and Improvement of Man- 
Machine Systems,” Proc. EJCC (Decem- 
ber 1957) , p. 90. 

36. SUSSKIND, A. K., “Notes on Analog-Digital 
Conversion Techniques,” M.I.T., Technol- 
ogy Press (1957). 

37. VOLDER, J. E., “The Cordic Trigonometric 
Computing Technique,” IRETEC (Septem- 
ber 1959), pp. 330-334. 

38. West, G. P., “Computer Control Experi- 
ence Gained from Operation of a Large 
Combined Analog-Digital Computation 
System,” Proc. of Computers in Control 
Systems Conference, Atlantic City (Oc- 
tober 1957), p. 95. 

39. Wilson, A., “Use of Combined Analog- 
Digital System for Re-entry Vehicle Flight 
Simulation,” Proc. EJCC, Vol. 20 (Decem- 
ber 1961), pp. 105-113. 

40. WiTSENHAUSEN, H., “Hybrid Simulation of 
a Tubular Reactor” (1962), EAI Bulletin, 
No. ALHC 6252-25. 

41. , “Hybrid Techniques Applied to 

Optimization Problems,” Proc. SJCC, Vol. 
21 (May 1962). 

42. Wortzman, D., “Use of a Digital/ Analog 
Arithmetic Unit within a Digital Com- 
puter,” Proc. EJCC (December 1960), 
p. 269. 

43. Zetkow, G., and Fleisig, R. (Grummen 
Aircraft Corp.), “Dynamic Analysis of 
OAO Spacecraft Motion by Analog-Digital 
Simulation,” IRE Convention, Space Elec- 
tronic Session (March 1962). 




A HYBRID ANALOG-DIGITAL PARAMETER OPTIMIZER 

FOR ASTRAC II 

Baker A . Mitchell, Jr. 

Department of Electrical Engineering 
University of Arizona 
Tucson, Arizona 


INTRODUCTION 

This paper describes an optimizer designed 
to find system parameter combinations which 
optimize a functional, F, such as 

T 

F(a„...,a„) = f [yHt) +u2(t)] dt 

0 

where 

y(t) — y(t, ai, ail) 

U(t) = U(t, ai, . . . , a„) 

are state and control variables depending on the 
unknown parameters ai, . . . , an in accordance 
with the system equations 

yi. = fi(yi. .. .,yk;ui, ...Un,;t) 

The new optimizer is designed to work with 
a fast all-solid-state iterative differential an- 
alyzer (ASTRAC II) which is capable of pro- 
ducing complete solutions yi(t) and the corre- 
sponding values of the performance measure 
F(ai, an) for up to 1000 new parameter 
combinations per second.^ 

To simplify optimizer logic and memory re- 
quirements in problems involving many param- 
eters, we simultaneously implement random 
perturbations- on all parameters ak and step to 
the perturbed point whenever the perturbation 
yields an improvement in the performance 
measure F (ai, . . . , an) . 


Simple all-digital logic permits implementa- 
tion of different sequential optimization strate- 
gies, including correlation between random- 
perturbation vectors and step-size changes de- 
pending upon past successes and failures. The 
analog integrator/multipliers commonly used 
to set system parameters have be^n replaced by 
simple, reversible binary counters driving D/A 
converters^ for simplified design and improved 
reliability.^ The principle of the optimizer is 
shown in the block diagram of Fig. 1. 



Figure 1. Optimizer Block Diagram. 
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Figure 2a. Optimum Gradient Method. 



Figure 2b. Pure Random Perturbations. 





02 


Figure 2c. Correlated Random Perturbations. 


Figure 2. Typical Optimization Paths. 


Figure 2a shows the parameter-space path 
over which a conventional deterministic system 
would optimize a simple two parameter system. 
Starting with a trial set of parameters “ai the 
conventional optimization logic employs the re- 
sults of successive differential analyzer runs to 
obtain succeeding parameter values 

* tti ~ “1“ ’ Aai 

v/hich successively improve '■F= F (''aj, . . . , ‘‘a,,) . 


The most frequently used method employs n 
trial steps to compute approximate gradient 
components AF/Aa in each parameter direc- 
tion ; these gradient components are then stored 
and used to compute the optimal correction Aai 
for a working step, or for a series of working 
steps in the same direction.® 

Such deterministic methods require complex 
logic and storage. Although they may converge 
well for favorable performance functions F(ai, 

. . . , an), they may “hang up'’ on ridges or in 
canyons of the multidimensional landscape of 
the performance measure domain.^ Further- 
more, if the performance measure contains dis- 
continuities, nonlinearities, or large higher- 
order derivatives with respect to the param- 
eters, our information of past performances 
will be of little value in determining succeeding 
«teps; thus the step-size may have to be re- 
duced to such a degree that convergence to the 
optimum is extremely time-consuming. 

Figure 2b shows how a pure random- 
perturbation scheme might optimize the same 
function. Here, Aai may be positive, zero, or 
negative with equal probability; and the nomi- 
nal parameter point is moved as soon as the first 
improvement in the performance function 
occurs. No attempt is made to affect perturba- 
tions by past results or gradient methods. 

On the other hand, if perturbations are to be 
correlated with past successes or failures, then 
the optimization path might appear as shown 
in Fig. 2c. Such a scheme causes future incre- 
ments Aai to favor the direction in which past 
improvements in the performance function 
were made. Notice, however, that we still do 
not require computation of individual gradient 
components, as in deterministic gradient optimi- 
zation schemes. Hence, logic and memory re- 
quirements are reduced. 

Motivation for Random Search 

The basis for all direct computer methods of 
parameter optimization is the same: using a 
mathematical model or simulated system, we set 
the parameters to some trial values and com- 
pute the performance criterion. Then according 
to some rule, we reset the parameters to new 
values and again compute the performance cri- 
terion. This procedure is repeated until some 
desired degree of improvement is obtained. 
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Naturally it is hoped that the rule for adjusting 
the parameters will take maximum advantage 
of the knowledge gained from observing previ- 
ous trials, and by so doing achieve the optimum 
set of values for the parameters in the shortest 
possible time. Usually, however, this rule de- 
pends solely upon the knowledge gained from 
recent past trials and this is thought to be 
equivalent to using this knowledge to maximum 
advantage. If, however, the performance cri- 
terion contains discontinuities, nonlinearities, 
or large higher-order partial derivatives with 
respect to the parameters, our information of 
recent past behavior (actually a total or partial 
first derivative) may be of little value for the 
determination of successive steps; and if the 
step size is too large, this information from the 
preceding step (or from a short forward trial 
step) will be totally misleading. Thus, with con- 
ventional, deterministic perturbation such as 
the gradient method, one may be forced to re- 
duce the step size to such a degree that con- 
vergence to the absolute optimum is excessively 
time-consuming. For this reason it has been 
suggested that randomness be introduced into 
the search rule — perhaps in proportion to the 
expected severity of the discontinuities, non- 
linearities, etc., present in the cost function 
domain. 

In reality, it may be difficult to make any 
reliable prediction concerning the behavior of 
the performance function. Even with reason- 
ably well-behaved performance functions, it can 
be quite difficult to foresee ‘‘ridges,"’ “temporary 
plateaus,” “saddle-points,” and other features 
which render deterministic rules far from fool- 
proof. 

Reference 5 goes further into such motiva- 
tion for random-search methods (Figs. 3 and 
4). 

PRINCIPLES OF OPERATION 

Most of the optimization strategies proposed 
here can be based on the flow chart of Fig. 5. 
The operation common to all the various strate- 
gies consists of incrementing all parameters 
simultaneously by individual random incre- 
ments + Aa, — A a, or zero ; the common magni- 
tude Aa (step size) of these increments is sub- 
ject to a separate decision. The term success 


will be used to indicate that the incremented set, 
ai + A a, has yielded a more favorable value for 
the performance measure than was obtained 
with the unincremented set, ai. A failure will 
mean that the incremented set yielded a less 
favorable value for the performance measure, 
in which case the failing increment is sub- 
tracted from the parameter before a new incre- 
ment is added. Successive failures and successes 
can be counted and used to decide when an in- 
crease or decrease in the step size might be 


A02^4 AO 2 CI 2 



Figure 3 . The function defined represents a system com- 
posed by particular percentages of each of the four 
compounds shown at the corners. Contour lines are 
drawn in order to indicate values of a property on the 
system. The heavy lines are lines of discontinuity in 
slope .5 



Figure 4 . This sketch of a dynamic vibration absorber 
shows the amplitude of vibration F plotted over the fre- 
quency range w for values of the parameter ai. The 
effect of only one of the three parameters, ai, 02, 03, of 
the actual system could be drawn. A possible criterion 
for an optimum absorber is to require that the maxi- 
mum amplitude of vibration yielded by a particular set 
of parameter values, ai, 02, as, be minimum over the 
frequency range of interest.® 
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Figure 5. Flow Diagram for a Typical 
Optimization Routine. 


advantageous. A binary noise generator^ to- 
gether with digital correlation logic decides 
what the sign of each new increment will be. 
Thus, the overall effect of the perturbation 
scheme is an n-dimensional random walk. 

THE ASTRAC II SYSTEM 

The dynamic system and the performance 
criterion are to be simulated on the Arizona 
Statistical Repetitive Computer, ASTRAC II, 
although ASTRAC I served for preliminary 
studies. ASTRAC II is a ±10 volt, all-solid- 
state iterative differential analyzer capable of 
iteration rates of 1 Kc. as well as real time com- 
putation. The first 20-amplifier section of 
ASTRAC II is to be completed in the fall of 
1964. 

The analog section has a large conventionally 
appearing patchbay. 20 Me. transistorized am- 
plifiers mounted in shielded cans plug directly 
into the rear of the patchbay without any in- 
tervening wiring. The analog section will com- 
prise sample-hold memory pairs, comparators, 
analog switches, switched integrators, diode 
quarter-square multiplifiers, and diode function 
generators. 

Timing and logical control is furnished by the 
digital section, which provides timing pulses, 


integrator RESET pulses, and sampling pulses. 
The digital section has its own patchbay with 
removable patchboards for implementing vari- 
ous logic functions. Patchable gates and flip- 
flops are used in conjunction with the prewired 
timing and RESET circuits. In view of the 
amount of logic involved in the optimizer, how- 
ever, it was thought best to build it as a separate 
digital section with its own removable patch- 
boards, devoted to this purpose. The complete 
ASTRAC II optimizer will not only implement 
the sequential random search optimization de- 
scribed here, but will permit comparison with 
deterministic optimization schemes. 

Optimizer Logic 

The digital logic of the optimizer is sub- 
divided into basic functional units whose in- 
puts, outputs, and control points are wired to 
its patchbay (Fig. 6) . With a different prepatch 
panel, these components are also available for 
other uses besides optimization. In particular, 
the parameter-setting circuits will also serve 
for experiments with deterministic optimiza- 
tion schemes. 



Figure 6. Photograph of the Optimizer. A D/A 
multiplier card is shown at bottom left. 
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The functional units are built of commercial 
plug-in logic cards interconnected on racks with 
wire-wrap terminations for ease of modification 
and expansion. 

The resistor networks and switches compris- 
ing the D/A multipliers are mounted in shielded 
plug-in cans adjacent to the operational ampli- 
fiers behind the analog patchbay. Shielded digi- 
tal control lines connect each D/ A multiplier to 
the optimizer patchbay. 

Hybrid Analog-Digital Noise Generator 

ASTRAC II will employ a new noise genera- 
tori® producing pseudo-random maximum- 
length shift-register sequences at any desired 
clock rate up to 4 Me. We may obtain either a 
single pseudo-random sequence repeating after 
33 million bits, or four uncorrelated sequences 
one-fourth as long. 

Success-Failure Indicator 

Essentially, the function of this circuit (Fig. 
7) is to compare the value of the best perform- 
ance measure, ’"F, obtained to date, with the per- 
formance measure just yielded by the last com- 



6 


7 


OPTIMIZER FOR ASTRAC II 275 


puter run, T. If the last run was a failure, 
(e.g., "^F < ^F for maximization), then U 3 and 

S remain zero ; and ^F is still held as being the 
best value. If, however, the last computer run 
was a success (’’F > ^F), then U 3 becomes “1” 
which causes the second sample-hold to take on 
the value just obtained, ■’F, as now being the 
new optimum. The pulse U 3 also causes fiip-fiop 
7 to assume the “1” state (s-o) which indicates 
to the digital optimization logic that a success 
was obtained with the last set of incremented 
parameters. 

If it is known that F will always be mon- 
otonically increasing during the latter part of 
the COMPUTE period. Switch 3 and U 2 need 
not be used. 

Master Clock 

The Master Clock provides all timing pulses 
throughout the digital optimization routine. It 
consists of a four bit Gray-code counter driven 
by a 1 MC. pulse, Ci, from the differential- 
analyzer digital control unit. Ci is gated to the 
counter during the differential-analyzer RESET 
period. With the exception of the Success- 
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control of the differential-analyzer, no optimiza- 
tion logic is performed during the COMPUTE 
period of the differential analyzer. 


Barometer Setting 



Figure 9. Parameter Setting. 



The sequence of the timing pulses from the 
Master Clock is shown in Fig. 8. S', as shown, 
occurs only after failures, i.e., S = 1 ; if S = 
0, S' = 1 throughout the RESET period. A suc- 
cess or failure also causes ns or np to occur 
(Fig. 15). 

PARAMETER SETTING 

The flexibility needed to implement a variety 
of different optimization-logic schemes while 
maintaining simplicity, reliability, and low cost 
is achieved by using a unique method of param- 
eter setting. 

Binary -Counter Operation 

Referring to Fig. 9, the binary up-down 
counter increments whenever a pulse appears 
on the “F’ line. The right-left shift register 
contains zeros in all except one of its stages, 
and the position of this *'1'’ selects the stage of 
the binary counter which is to receive the “F" 
pulse. By controlling the D/A multipliers, the 
counter has then increased or decreased ai by an 



Figure 10a. U-L-D Selector Circuit. 


Figure 10b. U-L-D Memory. 
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increment Aai ; the magnitude A is determined 
by the position of the in the shift register, 
and the sign (+ or — ) is determined by the 
logic level on the UD line (1 count up, 0 = 
count down) . 

U-L-D Digital Logic 

The U-L-D (UP-LOCK-DOWN) Logic is 
composed of two sections: a central U-L-D 
Selector Circuit (Fig. 10a), and a U-L-D Mem- 


ory associated with the binary counter for each 
parameter (Fig. 10b). 

The selector circuit accepts two uncorrelated 
random bits ^Ni, ^N2 obtained from the 
ASTRAC II noise generator. Depending on the 
interconnections of the selector-circuit gates Gi, 
Gm, G:^, G4 to gates U, L, D, the gate outputs Uo, 
Lo, Do will have different joint probability dis- 
tributions as shown in Table I. 
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The gate outputs, Uo, Lo, Do, determine the 
states of the dc set and reset level controls on 
the UP-DOWN flip-flops and the LOCK flip- 
flops of all U-L-D Memories (one for each 
parameter) simultaneously. The pulse Mi from 
the Master Clock now first sets the U-L-D flip- 
flops of the first parameter to their proper 
states as determined by the first set of random 
bits iNi, 1 N 2 . 

Next, the noise generators are pulsed again 
by the Master Clock (pulses NN) producing 
two new random bits, “Ni, “Ni>, which determine 
a new set of states for the dc set and reset level 
controls of the U-L-D Memory flip-flops. Now, 
the pulse M 2 to the U-L-D Memory of the second 
parameter sets its UP-DOWN and LOCK flip- 
flops in accordance with -Ni, "N 2 . The process 
repeats for the remaining parameters. 

U-L-D Memory 

Note that we shall require two decisions for 
each parameter. The LOCK circuitry decides 
whether is to be incremented or not incre- 
mented (“locked'')* The UP-DOWN circuits 
decide the direction (up or down) of the incre- 
ments Attk, if any. We shall consider the UP- 
DOWN circuits first. 

If an UP-DOWN flip-flop is set (reset), a 
logical 1 (0) will appear on the “UD" line to 
the binary counter, if S' — 1. We now come to 
the LOCK decision. If the LOCK flip-flop is set 
or reset the incrementing commands “C" and 
“BC" from the Master Clock will or will not 
carry through the gates on “I" to increment the 
counters in the direction dictated by “UD" 

Correlation Circuit 

This circuit can be patched so as to introduce 
correlation between successive random pertur- 
bations according to some strategy selected to 
speed optimization. Suppose that the last set of 
increments succeeded in improving the per- 
formance function. Assuming that the perform- 
ance function is fairly well-behaved, the great- 
est chance for another success lies in weighting 
each parameter so that it ^ will probably incre- 
ment in the same direction as in the previous 
trial (strong positive correlation) . By the same 
reasoning, after a failure a strong negative cor- 
relation might be introduced. After either a 


failure or a success, if a particular parameter 
had been locked (i.e., its last increment was 
zero) then a probability function giving equal 
weight to all three states might be desirable for 
the next trial. 

Referring to Fig. 11 the correlation circuit is 
patched as desired and senses the sign of the 
last increment in each U-L-D Memory, starting 
with the first parameter. If Aak is positive, the 
RC line changes the selector gating so as to* in- 
crease the chance of a positive Aa^ for the next 
run if the last run was successful. The reverse 
can take place if the last run was a failure, de- 
pending upon the correlation control “CC". 

If, however, Aak was zero, as indicated by the 
state of the LOCK flip-flop, then the correlation 
line RC is turned on or off with equal probabil- 
ity by a random-noise input. This entire process 
is repeated for each parameter in turn, as they 
are sequenced by the pulses Pi from the Master 
Clock. 



^ S negdtive conrtlotion ofttr failure 

) S poeifivc corre lotion offer euccete 

) I negative and positive correlation 
VO no eorreiotion 


Figure 11. Correlation Circuit, 
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OPTIMIZATION STRATEGIES 

1, Pure Random Perturbations 

Pure random perturbations are achieved if 
we preset the '‘1’’ in the shift register to gate 
the count pulse into the binary counter stage 
yielding the desired step distance, e.g., V2> 

. . . , 1/128. The noise generator is connected 
directly to the SET and RESET level controls 
of the Up-Down flip-flop of each parameter. 
During the analog RESET period, the noise 
generator is pulsed once for each parameter 
shortly before the sequenced pulse to the proper 
Up-Down flip-flop arrives. Thus, the Up-Down 
flip-flops sequentially assume the successive 
states of the noise source. The count pulse, C, 
is then fed to all parameters simultaneously. 
Hence, all parameters increment by the same 
magnitude but with random signs during each 
analog RESET period, executing a random walk 
in the perturbation scheme. 

With pulses S, S' and BC added to the U-L-D 
Memory, the set of increments used in the pre- 
ceding run can be subtracted from the counters 
prior to the addition of increments for the next 
run. Thus, after a failure in an optimization 
run, the counters can be returned to the state 
which yielded the last success, before making 
another search. 

2, Random Walk with Reflecting Barriers 

This scheme is exactly the same as the pure 
random walk with one additional operation. 
Prior to assigning a new set of signs to the 
Up-Down flip-flops, a pulse to signify that a 
parameter has reached a barrier — ^possibly from 
a comparator in the analog system — can be 
gated to the U-L-D logic of the parameter to be 
reflected, causing its Up-Down flip-flop to com- 
plement and, later, to ignore the sign that is 
assigned to it by the noise source. In this man- 
ner, the parameter will have been reflected to its 
old position held two analog COMPUTE periods 
previously. 

S. Random Walk with Varied Step Size 

This scheme also contains only one addition 
to the pure random walk. When it is desired to 
increase or decrease the step size — possibly 
after a certain number of successive failures 


have been counted — one has only to insert the 
increase or decrease command, e.g., the counter 
output, into the shift-right or shift-left input 
of the shift register. This gates the count pulse 
either to the next higher or next lower param- 
eter counter stage. The noise generator then 
assigns polarities to the Up-Down flip-flops, and 
the parameters are incremented by the new step 
size. 

For this purpose, the optimizer contains two 
success-failure counters. 

i. Correlated or Biased Random Walk 

Since two independent noise sources are 
available, one can arrange P(NiN 2 ) — 1/4, 

P(NiN2) = 1/4, P(NiN 2 ) = 1/4, P(NiN 2 ) = 1 / 4 . 
These can combine to give P(X) = i/^, or 

P(Y) = where X = N 1 N 2 + N 1 N 2 , Y 

N 1 N 2 +N 1 N 2 +N 1 N 2 . There are three possible 
states for the U-L-D Memory : 1. Up ; 2. Down ; 

3. Lock. If a failure occurred in the Up state, 
after subtraction of the failing set of incre- 
ments, it would be desirable to assign weighted 
probabilities which would be more likely to re- 
sult in the next state being Down. The proba- 
bility distributions which can be prepatched are 
listed in Table I. Likewise, if a success occurs 
in the Down state, the next assignment of states 
could be weighted on the same basis. 

This strategy can be combined with step size 
variations implemented with the aid of the 
success/failure counters. 

TESTS 

The following tests were carried out using 
the optimizer in conjunction with ASTRAC I, 
a ±100 volt, 100 run/sec. iterative differential 
analyzer. 

In order to generate performance measures 
with the precise characteristics desired, only 
algebraic functions F(ai, 02) were used for 
quantitative evaluation of the various optimiza- 
tion strategies. For practical use in optimizing 
dynamic systems, the optimizer setup would be 
entirely unchanged, except that the function 
F (ai, 02 ) would be generated as samples at the 
end of a differential-analyzer run. 
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Also, the relatively slow repetition rate of 10 
runs/ sec. was employed only for recording pur- 
poses. 


were allowed to vary over ±100 volts. Conver- 
gence was considered to be satisfactory when F 
was within 0.5 volt of the maximum. 


Functions Optimized 

Initial tests were made with two different 
types of performance functions. 

Minimization of functions of the general type 


F (ai, ttii) 


(ai— ki)^ 

a 



was carried out. The parameters ai and ao were 
allowed to vary over ±100 volts, and conver- 
gence of ai, a* was considered to be satisfactory 
when they were within 0.78 volts of their values 
which optimized F. 


The next experiment involved maximization 
of a function F(ai, a^) exhibiting sharp ridges 
formed by the intersection of three planes as 
shown in Fig. 12a. The equation for this func- 
tion is 


ai 

F (ai, a2) == Max -|- 2ai 100 

a 2 (optimum) =66.6; a, (optimum) =44.4 (8) 

Extremely sharp ridges or intersections were 
simulated by using accurate high-speed selector 
circuits (Fig. 12b). The parameters ai and a 2 



Strategies Employed 

Parameters were allowed to step only after 
successful runs, i.e., unsuccessful parameter 
changes were subtracted out. 

Various types and degrees of correlation be- 
tween successive parameter runs were tried. 
(See Correlated and Biased Random Walk.) 

The step size. A, was increased by a factor of 
2 after Ns consecutive successes and decreased 
after Np consecutive failures; Ns and Np were 
varied. 


ISOK SOK 



Figure 12a. Ridge. 


Figure 12b. Ridge Simulation. 
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Results 

For the paraboloids (Equation 6), several 
thousand optimization trials were recorded us- 
ing many variations within the general class of 
strategies outlined above. 

As the eccentricity of the contours of con- 
stant F was increased, convergence was slowed 
somewhat but not radically. In no cas<^ were 
more than 70 runs required for convergence. 

For the case, a = b = 1, using no correlation 
in the perturbation scheme, resulted in an aver- 
age of 28 runs required for convergence with a 
standard deviation of 9 runs. 

For the case, a = b = 1 using strong positive 
correlation with successes and strong negative 
correlation after failures, CC = 1 ; the average 
trial converged in 16 runs. 

The most favorable step-size variation 
strategy was to increase A after 2 successes 
and decrease A after 2 failures. 

The initial points of °ai, were always kept 
at the extreme of their ranges and only a slight 
decrease in convergence-time was noted as the 
initial point was placed closer to the optimum. 

The particular ridge (Equation 7 ; Fig. 12a) 
to be discussed here is one expressly designed 
to present the greatest difficulty to the opti- 
mizer. Referring to Fig. 12a, it is seen that 
improvement is quite difficult if the parameter 
point falls close to the ridge. Had the sides not 



sloped so steeply, or had the ridge been oriented 
differently with respect to the parameter axis, 
the optimizer would have found convergence 
more natural. 

Fig. 13a shows the optimization of this func- 
tion without step size changes (path A in Fig. 
12c). Note that the parameter point followed 
the direction of greatest improvement until it 
reached the ridge, then both parameters were 
forced to zigzag up the ridge to the peak. The 
step size was constant at 1.56 volts, and 1800 
runs were necessary to converge. In Fig. 13b, 
the step size was again held constant; but the 
binary counter for complemented at the be- 



Figure 13a. Optimization of Ridge. 



Figure 12c. Optimization Paths. 


Figure 13b. Optimization of Ridge. 
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ginning of both trials No. 1 and No. 2. Thus, it 
was not necessary to climb the longest ridge. 
The paths converged more quickly up the 
shorter ridge (path B in Fig. 12c). This favor- 
able possibility is not present in ordinary 
gradient techniques. Path A in Fig. 12c is most 
nearly like a path resulting from conventional 
gradient techniques. 

Fig. 14a shows a correlated random search 
with step-size variations employed in the strate- 
gies. The optimizer is not confined to merely' 
zigzagging slowly up the ridge but can traverse 
great distances while searching for improve- 
ments. Correlation improved convergence time 
somewhat, but only when weak positive corre- 
lation after successes was used. For 45 trials 
using this correlation, the average time to con- 
verge was 73 runs if the step-size was decreased 
after 2 failures. Other types of correlation 
slowed convergence by 10-20 per cent. 

For comparison note that an optimizer going 
directly in a straight line to the optimum point 
would require 106 steps if a fixed step size of 
1.56 volts were used. 

Stopping Conditions 

When the optimum value of F is unknown, 
defining a stopping condition is subject to sev- 
eral factors. One such factor arises when we 
are not certain that only one peak exists in the 
domain of F. In this case, we would want to 
cycle the step size through its decrease-increase 
scheme several times before stopping. Then a 
rescaling of the simulation might be desirable 
in case the initial ranges of the parameters was 
chosen too large. 

Conclusions 

While experience with this optimizer is still 
quite limited, it appears that its performance 
can compare quite favorably with conventional 
techniques. Better conclusions can be made 
when this system is expanded to accommodate 
four parameters and the logic is enlarged to 
permit implementation of the conventional de- 
terministic schemes— permitting a direct com- 
parison between random and deterministic 
methods optimizing the same function. 

Presently, however, the random techniques 
seem well able to handle cases in which the 
performance measure is not well-behaved (Fig. 
3 and 4). 
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Figure 14b. Employing Step Size Variations. 

ACKNOWLEDGEMENT 

The project described in this report is part of 
a hybrid analog-digital computer study directed 
by Professor G. A. Korn. The writer is grateful 
to the Office of Aerospace Research, Informa- 
tion Research Division, Air Force Office of 



A HYBRID ANALOG-DIGITAL PARAMETER OPTIMIZER FOR ASTRAC II 283 


Scientific Research and to the Office of Space 
Sciences, National Aeronautics and Space Ad- 
ministration for their continuing support of this 
study under joint grant AF-AFOSR-89-63 ; and 
to Professors L. W. Matsch, Dean of Engineer- 
ing, and H. E. Stewart, Head, Department of 
Electrical Engineering, for their encourage- 
ment and contribution of University facilities. 

REFERENCES 

1. Korn, G. A., and T. M. Korn, Electronic 
Analog and Hybrid Computers, McGraw- 
Hill, N. Y., 1964 (in print) . 

2. Munson, J. K., and A. 1. Rubin, “Optimi- 
zation by Random Search on the Analog 
Computer,” IRE Trans. PGEC, June, 1959. 

3. Wait, J. V., and B. A. Mitchell, “A Sim- 
ple Solid-State Digital-to-Analog Con- 
verter for Hybrid Systems,” ACL Memo- 
randum No. 61, University of Arizona, 
1963. 

4. Mitchell, B. A., “A Hybrid Analog-Digi- 
tal One Parameter Optimizer,” Ann. AICA, 
January, 1964. 

5. Karnopp, R., Ph.D. Thesis, Massachusetts 
Inst, of Technology, 1963. 

6. Brooks, S. H., “A Comparison of Maxi- 
mum-seeking Methods,” Operations Re- 
search, July-August, 1959. 



7. Hampton, R., G. A. Korn, and B. A. 
Mitchell, “Hybrid Analog-Digital Ran- 
dom-Noise Generation,” IEEE Trans. 
PGEC, August, 1963. 

8. Witsenhausen, H. S., “Hybrid Techniques 
Applied to Optimization Problems,” Proc. 
SJCC, 1962. 

9. Howell, M., “Automatic Parameter Op- 
timization as Applied to Transducer De- 
sign,” Proc. SJCC, 1963. 

10. Hampton, R., “Hybrid Analog-digital 
Pseudo-random Noise Generator,” Proc. 
SJCC, 1964. 

APPENDIX I 

The following units are contained in the 
optimizer digital and/or are wired to the opti- 
mizer patchbay. On the drawings, a small circle 
on the end of a wire indicates a patchbay termi- 
nation. 

1. 3 4-bit Gray code counters. One of these 
counters has a free-running multivibrator 
for the input. This counter is used exclu- 
sively for sequencing operations through- 
out the logic scheme subroutine, e.g., the 
analog RESET period. The remaining two 
Gray code counters have their flip-flop 
outputs adjacent to two gates on the 
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Figure 15. Gray Code Counter Circuit, 
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patchboard. These may be patched to yield 
outputs after any preset number of input 
pulses. These counters may reset them- 
selves or may be reset externally (Fig. 
15.) 

2. 1 8-bit right-left shift register. The DC 
set and reset lines, along with the set and 
reset outputs, appear on the patchboard 
for parallel drop-in on command. Also the 
set and reset level controls for the first 
and last flip-flops are on the patchboard 
thus permitting operation as a ring coun- 
ter. Four inputs are provided for shifting 
in either direction. (Fig. 17b.) 

3. 4 8-bit up-down binary counters. The DC 
set and reset lines, along with the set and 
reset outputs, appear on the patchboard 
for parallel loading. A gated complement 
input for each flip-flop is brought out, 
allowing the counter to be stepped at any 
stage. A flip-flop with associated gates 
permits both pulse and level control of the 
up-down lines. (Fig. 17a.) 
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Figure 16. D/A Multiplier and Switch. 



Figure 17a. UP-DOWN Binary Counter. 
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Figure 17b. Right-Left Shift Register. 
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4. 4 8-bit D/A multipliers. The digital con- 
trol lines for the D/A multipliers 
(mounted in shielded cans behind the 
analog patchbay) are wired to the digital 
patchbay. Each set of lines is associated 
with one of the up-down binary counters 
(Fig 16.) 

5. 2 pseudo-random discrete-interval, binary 
noise generators. The noise generator for 
ASTRAC II is a 25-stage shift register 
with modulo 2 adders in feedback, gen- 
erating a maximum length of 3X10^ ran- 
dom bits. This can be divided into two 
noise generators, each having a 1.5X10^ 
random bit sequence. The noise generator 
shifts out a random bit when a pulse is 
applied to the shift input. These inputs, 
outputs, and complement outputs appear 
on the optimizer patchbay as well as on 
the noise generator patchbay. 



Figure 18 . Digital Logic Block Diagram. 
















A HYBRID ANALOG-DIGITAL PSEUDO-RANDOM 
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I. INTRODUCTION 

Analog and hybrid analog-digital computers 
used for random-process and statistical studies 
require noise sources capable of generating 
random signals whose amplitude distribution, 
d-c unbalance, spectrum, and RMS level are 
specified within the computer accuracy limits. 
Noise samples must not be correlated for time 
delays exceeding one-ten-thousandth to one- 
thousandth of a computer run. 

Noise derived from thyratrons in magnetic 
fields, noisy diodes and photomultiplier tubes 
coated with radioactive paint changes with 
various environmental conditions. Thus, elabo- 
rate gain-control, sampling and/or filter cir- 
cuitry is required in order to meet computer 
specifications. By using the random signal from 
such sources to trigger a flip-flop or threshold- 
sensing comparator, flat-spectrum binary noise 
with =bA volt output levels is obtained. Such 
binary noise can easily be Altered to produce 
Gaussian signals and is very useful in its own 
right : it can be used to drive analog or digital 
switching systems used to simulate random 
events, machine failures, etc., and lends itself 
to direct correlation with other signals without 
the use of analog multipliers. By precision 
clamping, the RMS level of binary poise can be 
closely controlled, but the non-stationarity of 
the circuits used to obtain electrical noise, even 


from stationary mechanisms such as a radio- 
active source, still create problems and ex- 
pense.i For example, the 80 Kc random-tele- 
graph wave generator developed at The Univer- 
sity of Arizona’s Hybrid Computer Laboratory 
and described in Ref. 2 required a fairly sophis- 
ticated and not completely satisfactory count- 
rate control loop. 

In the design of the University of Arizona’s 
new ASTRAC II iterative differential analyzer, 
which is to be capable of taking statistics over 
1,000 random-input computer runs per second, 
it was decided to abandon analog noise genera- 
tion completely. Instead, the machine will em- 
ploy a digital shift-register sequence generator 
that can produce binary pseudo-random noise 
sequences at any clock rate between zero and 
4 Me. This permits exact time scale changes or 
intermittent operation. The noise generator 
produces digital computer random numbers as 
well as analog noise. The digital numbers are 
easily stored or transmitted and may be used to 
produce binomially or normally distributed 
random analog coefficients. Digital multiplex- 
ing yields multiple uncorrelated noise signals 
from a single shift-register. These noise sig- 
nals are independent of any physical quantity 
except for the output clamping levels. Also the 
flat spectrum binary output permits direct logi- 
cal or analog multiplication. The length of the 
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pseudo-random output sequence is 33, 554, 431 
bits which is equivalent to several thousand 
computer runs. 

II. BINARY PSEUDO-RANDOM NOISE 
GENERATION WITH SHIFT-REGISTERS 

2.1 Pseudo-Random Binary Sequences 

Binary pseudo-random noise, as the term is 
used in this paper, differs in two important re- 
spects from purely random binary signals : 

1. A truly random binary signal is non- 
periodic, while a pseudo-random sequence 
repeats itself after some suitably long se- 
quence. 

2. In many random binary processes (e.g., a 
random telegraph wave) the transition 
from the “1” state to the “0” state (or 
conversely) can occur at any time, and 
the state at any instant of time is inde- 
pendent of the state at any other instant 
of time. In the pseudo-random binary 
process the binary level transitions can 
occur only at specific clock pulse times, 
separated by intervals during which the 
binary state is fixed. In this case the state 
during the fixed time interval is independ- 
ent of the state in neighboring time inter- 
vals. 

A periodic binary sequence will be classified 
as a pseudo-random sequence if it satisfies the 
following conditions : 

1. In each period the number of “I’s” must 
not exceed the number of “O’s” by more 
than one (or conversely) . 

2. In each period there must be twice as 
many sequences of “I’s” or “O’s” of length 
n as those of length n -f 1. 

3. The autocorrelation function must have 
the form shown in Fig. 3b, i.e., peaked in 
the middle (t = 0) and tapering off 
rapidly at both ends.* 

Such a binary sequence with a sufficiently long 
period — longer than a desired series of com- 
puter runs — can be used essentially like true 
random noise for computing purposes. 

Periodic binary sequences may be obtained 
from a digital shift-register with modulo-2 
adder feedback (Fig. 1). Module-2 addition 


generates the sum (A -f- B)Mod: 2 of any two 
binary inputs A and B according to the follow- 
ing table : 



Figure 1. 


A 

B 

(A -)- B)iiod; 2 

1 

0 

1 

0 

1 

1 

1 

1 

0 

0 

0 

0 


As can be seen from the above table, mcdulc-2' 
addition can be implemented logically with an 
EXCLUSIVE — OR circuit. The shift-register 
consists of cascaded flip-flops driven at the de- 
sired rate by external clock pulses (shift 
pulses). The outputs of certain flip-flops are 
added modulo-2 and their sum is then fed back 
to the first stage of the shift-register. 

Figure 2 illustrates three shift-register peri- 
odic sequence generators and their correspond- 
ing sequences. Each column of “I’s” and “O’s” 
corresponds to the successive states of each 
stage of the register. It should be noted that 
No. 1 and No. 2 outputs are periodic every 
2* — 1 = 7 bits, while No. 3 is periodic every 
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I 0 I 


I 0 I 


I 0 I 
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0 0 0 
0 0 0 


0 0 0 0 0 0 
0 0 0 0 0 0 


0 0 0 


0 0 0 0 0 0 


Fignire 2. 3-Stage Periodic Binary Sequence Generators. 


3 bits. In each case the periodic series is com- 
pletely determined by the initial state of the 
flip-flops and by the feedback connections. The 
resulting sequences for an all-zero initial condi- 
tion are also shown. 

2.2 Maxi'murri Length Sequences 

It is easy to show that the maximum length 
of any sequence produced by a shift-register is 
2" — 1, where n is the number of flip-flop stages. 
For an n-stage generator, there are 2“ possible 
states. The all-zero state can be ruled out as an 
admissible condition, since with modulo-2 addi- 
tion, each succeeding state would also be all 
zero as illustrated in Fig. 2. Therefore, the 
sequence is periodic with a maximum of 2“ — 1 
bits. Golomb has proven that every maximum 
length shift-register sequence satisfies the three 
conditions required for a pseudo-random se- 
quence.* 

The number of different maximum length 
series obtainable from an n-state shift-register 
0(2“ 1) 

is given by — ^ where 0 (m) is Euler’s 

Phi Function, defined as the number of integers 
s such that 0 < s < m and s is prime to m. For 
a three-stage generator n = 3, m = 2" — 1 = 7, 


and the integers s are 1, 2, 3, 4, 5, 6. Therefore, 

- ~ = 2, so a three stage shift-register 

can produce two different maximum length se- 
quences, each corresponding to a unique feed- 
back arrangement. Generators No. 1 and No. 2 
in Fig. 2 show these two feedback arrange- 
ments. Generator No. 3 in the same figure is 
an example of a feedback arrangement that pro- 
duces a non-maximum length sequence. Non- 
maximum length sequences do not in general 
satisfy the three conditions required for pseudo- 
random sequences, and thus they are not useful 
for statistical studies. Consequently, to design 
a practical pseudo-random noise generator of 
n-stages, it is necessary to determine the feed- 
back connections which produce a sequence of 
2" — 1 bits in length. 

Another important characteristic of maxi- 
mum length sequences is the so-called “shift 
and add” property. That is, when any maxi- 
mum length sequence is delayed by an integral 
number of clock periods and then added mod- 
ulo-2 with the original sequence, a third identi- 
cal sequence delayed with respect to the first 
two is formed.® This property has a useful 
application,- as will be pointed out in Section III 
where the actual hardware design of a binary 
pseudo-random noise generator is considered. 
The “shift and add” property is also used to de- 
rive the autocorrelation function for maximum 
length sequences (see Appendix). 

2.3 Obtaining the Maximal Period 

A mathematical technique for obtaining 
maximum length sequences is rigorously pre- 
sented in Refs. 3 and 5. The method consists of 
viewing each state of the n-stage shift-register 
as an n-dimensional vector and the shift- 
register/modulo-2 adder system as a linear 
operator, producing the successive states of the 
n-dimensional vector. Such an operation may 
be represented by an n x n matrix X. The first 
row of this matrix corresponds to the first stage 
of the register, the second row to the second 
stage, etc. The same is true for the n columns. 
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i.e., the first column represents the first stage, 
etc. Each element of the matrix is either a “1” 
or “0”. A “1” in any position indicates that the 
flip-flop stage denoted by the column drives the 
flip-flop stage denoted by the particular row. 
Otherwise, a “0” elements exists. Considering 
generator No. 1 in Fig. 2, the X matrix may be 
constructed by observing that stage 1 is fed by 
stages 2 and 3 ; therefore in the first row, a “1” 
is entered in columns two and three. Stage 2 
is fed by stage 1, so in row two a “1” is placed 
in column one. Again stage 3 is driven only by 
stage 2, hence in row three a “1” is placed in 
column two. This completes the X matrix for 
generator No. 1, viz.. 


X=: 


0 

1 


0 


1 1 ' 

0 0 

1 0 


Similarly, for generator No. 2 the defining 


matrix is 





' 1 

0 

1 ' 

X = 

1 

0 

0 


. 0 

1 

0 , 


It should be noted that the diagonal below the 
main diagonal will always consist of a series of 
“I’s”, while the first row of the matrix repre- 
sents the feedback coefficients. Generalizing, 
the X matrix for n stages may be written : 


Cl 

Cj • 

• ■ Ci ■ 

• • c„ 

1 

o • ■ 

• • o • 

• • o 

o 

1 . 

o • 

• • o 

• 

• 

1 . 

• 

o 

0 • • 

■ • o 

’l o 


where the C’s represent the feedback coefficients (“0” or “1”) . The characteristic polynomial of the 
X matrix is thus. 


det [X — xl] = 



Cj — \ 

Cj. 

. . . Cl • • • 

c. 


1 

— A 

o • • • 

0 


o 

1 

■ — A • • • 

0 


• 

• 

1 . ■ 

• 


o 

o 

• • • O ■ 1 

— A 

= ( — x)"- 

-MCl- 

-A)— C2(— a)»-2-1-C3(- 

_A)»-3 

= ( 

— A)“| 


Ci C 3 C 3 

A A* A® ■■■■ 

C.' 

= 

( — D" 


r -1 “* 



i = l 


where 6 = ■ 


The bracketed term 



= 0 is 


defined as the characteristic equation of the 
shift-register generator. A necessary, but not 
sufficient, condition for the period of the shift- 
register to be of maximum length is that the 


characteristic equation be irreducible (unfac- 
torable). In Fig. 2 both generators No. 1 and 
No. 2 have irreducible characteristic equations, 
but that for No. 3 can be factored. Using a tech- 
nique known as the “sieve method,” the irre- 
ducible polynomials which produce maximum 
length sequence have been tabulated, e.g., see 
Ref. 4. 
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2.4 The Autocorrelation Function 

One of the most important properties of a 
maximum length sequence generator output is 
its time autocorrelation function. Since the 
sequence is periodic every 2“ — 1 bits, the auto- 
correlation function will also be periodic every 
2" — 1 bits. Each bit represents one clock 
period. At seconds in duration. In general, the 
autocorrelation function for the first period of 
an n-stage shift-register can be shown to be 


Rxx(r) 


1 — 


2 “ 


2 ". 


for I T I ^ 1 bit 



for I T I > 1 bit 


Rxi(t), for T equal to an integral number of bit 
intervals, is derived in the Appendix. Figure 3 


x(f) 


1 

1 








-1 













Figure 3. 


shows the autocorrelation function of a maxi- 
mum length sequence x(t). It can be seen that 
for time delays equal to or greater than one 
clock period, the correlation is essentially zero 
for large values of n. 

As opposed to the one-peak-per-period auto- 
correlation function of maximum length se- 
quences, the autocorrelation function of non- 
maximum length sequences has several positive 
and negative peaks per period.’" This results 
because, as stated before, non-maximum length 
sequences do not satisfy the three conditions 
for randomness. 


2.5 The Power Density Spectrum 

From the Wiener—Khintchine theorem, it is 
known that the power density spectrum Gxx(f) 
of a function x(t) is simply the Fourier trans- 
form of the autocorrelation function for x(t). 
As derived in the Appendix, the power density 
spectrum for a maximum length pseudo-random 
binary sequence x(t) is 

00 

G..(t) = -L S(f) + Y. ¥ 

a — — 00 
0 

where Z 2" — 1 = number of bits in one period 
of x(t) 

f,. = 1/At = clock frequency 

From the above expression, it should be noted 
that 

(1) Gxx(f) is a discrete spectrum with a 

f 

harmonic separation of c.p.s. 

(2) Nulls occur at integral multiples of the 
clock frequency fe. 

(3) The bandwidth of Gjx(f)'^ 0.32 f,.. 

Therefore, for a 25-stage shift-register (Z = 
22* — 1) and a 4 Me. clock rate, Gxx(f) has a 
harmonic frequency separation of 0.123 c.p.s. 
and a bandwidth of 1.28 Me. 

III. DESIGN OF A PSEUDO-RANDOM 
NOISE GENERATOR 

3.1 Overall Design 

The noise generator employs the same plug-in 
digital module cards (Computer Control Co. 
S-Pacs) standard in the new ASTRAC II com- 
puter. The block diagram of the entire noise 
generator system is shown in Fig. 4. This sys- 
tem will operate at clock rates as high as 4 Me 
and as low as desired. Multiplexing yields 4 
uncorrelated pseudo-random noise outputs from 
a single 25-stage shift-register. The clamping 
circuitry produces ±6 volt analog outputs 
which are easily filtered to generate Gaussian 
signals; diode function generators can be used 
to produce other types of analog noise with 
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specified amplitude distribution and power 
spectra. 

3.2 The Shift-Register and Modulo-2 Adders 

The shift-register consists of 25 general pur- 
pose flip-flops. The SET output of each flip-flop 
is connected to the Reset Level Control of the 
next stage, and the RESET output is connected 
to the Set Level Control (Fig. 5). The shift- 
register is normally driven from the main 4 Me 
ASTRAC II Clock Oscillator. The maximum 
length period for n = 25 is 2*® — 1 = 33, 554, 
431 bits. Since there are usually many feedback 
arrangements which will produce a maximal 
sequence, the one requiring the least hardware 
will be selected. Using the table in Ref. 4, it was 
found that if the outputs of stages 3 and 25 are 
fed back, only one modulo-2 adder is needed. 
Using this same table, the feedback arrange- 
ments, for n = 10 through n = 33, requiring 
only one modulo-2 adder to produce a maximum 
length were calculated and are listed in Table 


4 UMMrralafetf 
NoiM OulfNrt* 


Cteeh 

Oeclllater 


SMft 

Puim 


•WirCM 



Clompinf 

Circuitry 



precition - 
HflMted 


bifMiry 

noit« 


Gaussian 
nois* sritti 

(PMifISd 

spoclrum 


AV- 


Function 

Gsnsrator 


noiss Mini 
sp«cifls4 
ompliluds 
dtstrlbutian 
% spactmtn 


Figure 4, ASTRAC II Noise Generator. 



"1 1 I I I 7 

NOISE 6ENERATOR RESET LINE 

Figure 6. Pseudo-Random Noise Generator Connections. 


1 of the Appendix. In terms of C.C.C. logic 
modules, a modulo-2 adder requires 4 NAND 
gates. The first 3 gates form an EXCLUSIVE — 
OR, and the fourth gate provides the comple- 
ment of the EXCLUSIVE— OR circuit (Fig. 
5). The output of the modulo-2 adder is fed 
back to the Reset Level Control of the first 
shift-register stage, and its complement is fed 
back to the Set Level Control. 

The output of each flip-flop and its comple- 
ment are wired to a small patchbay (with a 
removable patchboard) provided for the noise 
generator. Since the binary sequence for any 
stage of the shift-register is the same as that 
of the preceding stage delayed by one clock 
interval, the patchbay allows access to 25 se- 
quences — each delayed by one clock interval 
from the previous one. The DC SET and DC 
RESET input terminals of each flip-flop are also 
brought out to the patchbay so that any desired 
initial state may be patched into the shift- 
register from the computer NOISE GENERA- 
TOR RESET LINE at the start of each compu- 
tation. This resetting feature is of considerable 
interest for rechecking a series of Monte Carlo 
computations with different pseudo-random 
noise inputs. 

If the NOISE GENERATOR RESET patch- 
ing is omitted, the noise generator will run free 
and produce periodic sequences at a repetition 
rate not in general commensurable with the 
analog computer iteration rate. 

To provide the option of having two com- 
pletely independent noise generators, the con- 
nections between shift-register stages 11 and 
12 are patched rather than wired internally. 
The first 11 flip-flops require only one modulo-2 
adder to produce a maximum length sequence, 
but the 14 stage shift-register requires three 
adders to produce a maximal period. One 
arrangement used to obtain a maximum length 
sequence from the 14-stage generator is to add 
the outputs of flip-flops 3, 4, 5, and 14 modulo-2 
and to feed the sum back to stage 1. 

As one of many alternative patching combi- 
nations, two identical 11-stage maximum se- 
quence noise generators, employing only one 
modulo-2 adder each, may be constructed by 
using the same feedback connections for the 
14-stage shift-register as are used for the 11- 
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stage register (the outputs of flip-flops 2 and 
11 are modulo-2 added and fed back to flip-flop 
1). The sequence output of the 14-stage shift- 
register is then taken from the eleventh stage, 
leaving 3 stages unused. 

3.3 Multiplex Logic 

The purpose of the multiplex logic is to pro- 
vide several (in this case four) uncorrelated 
pseudo-random noise sources from a single 
shift-register. There are at least two ways of 
accomplishing this. The first method consists 
of sampling the shift-register output once every 
fourth clock interval. This method is used in 
the noise generator for ASTRAC II, because it 
requires less digital equipment. The second 
method utilizes the “shift and add” property of 
maximum length sequences. 

3.3.1 Method I — Output Sampling 

The output sampling technique uses four 
staggered 1 Me pulse trains to sample the 4 Me 
pseudo-random noise output. Each pulse train 
samples the shift-register noise output every 
fourth clock interval, e.g., the first sampling 
pulse train senses the state of the noise genera- 
tor output during the first, fifth, ninth, etc., 
clock intervals. The final result is four essen- 
tially uncorrelated 1 Me pseudo-random noise 
outputs, each of, which is a maximum length 
sequence (33, 554, 431 bits if the 25-stage shift- 
register is used) . 

Since the 4 Me noise sequence is sampled only 
once every fourth clock interval by any one of 
the 1 Me pulse trains, it takes 4 periods of the 
4 Me sequence to produce the 33, 554, 431 bit 
period of any one of the 1 Me sequences. The 
1 Me sequences do not become periodic after 
the first 4 Me period because the number of bits 
in the 4 Me period is not evenly divisible by 
four. In fact, by designating the four 1 Me 
outputs as A(t), B(t), C(t), and D(t) respec- 
tively, it can be shown (see Fig. 6) that during 
the second 4 Me shift-register period A (t) takes 
the same sequence B(t) had during the first 
period of the shift-register etc., until the fifth 
period. Then A(t) begins to repeat the sequence 
it had during the first shift-register cycle. A 
similar process holds for the other three se- 
quences B(t), C(t), and D(t). Actually, all four 


A I B I C I D 

First period of the shift-register 


D I A I B I C 
Second period of the shift-register 


C I D I A 1 B 
Third period of the shift-register 


B I C I D I A 
Fourth period of the shift-register 


I A I B I C I D I 
Fifth period of the shift- register 

Figure 6. 

outputs have the same binary sequence, but each 
of them is delayed from the previous one by 
approximately 8, 388, 608 bits. That is, for the 
outputs A(t), B(t), C(t) and D(t), 

A (t) = B (t -H 8, 388, 608 bits) 

A (t) = C (t + 16, 777, 216 bits) 

A (t) = D (t -j- 25, 165, 824 bits) 

A (t) = A (t -1- 33, 554, 431 bits) 

It follows that the cross-correlation function of 
A(t) and B(t) will peak only after 8, 388, 608 
bits. For the sampling rate of 1 Me, 1 bit is 
equivalent to 1 microsecond in time. Therefore, 
the correlation peak occurs approximately 8.4 
seconds away from the time origin. The same 
is true for B(t) and C(t), or C(t) and D(t). 
This is long enough for, say, 500 computer runs 
using 15,000 bits per run from four essentially 
uncorrelated noise generators. The digital cir- 
cuitry required to implement the above opera- 
tions is shown in the Appendix. 

Using the same approach, other combinations 
are possible. For example, the ASTRAC II 




294 


PROCEEDINGS-^PBING JOINT COMPUTER CONFERENCE, 1964 


pseudo-random noise generator is designed so 
two uncorrelated 2 Me maximum length se- 
quences may be generated. Denoting them E (t) 
and F(t), then E(t) = F(t -|- 16, 777, 216 
bits) . One digital bit in this case is equivalent 
to of a microsecond. This added feature re- 
quires little additional circuitry, as can be seen 
in the Appendix. 

As with the shift-register, the flip-flops of 
the multiplex logic may be reset as desired by 
patching between the flip-flop DC SET or DC 
RESET terminal and the computer NOISE 
GENERATOR RESET LINE. 

3.3.2 Method II — “Shift and Add” Property 

In effect, the sampling method described 
above merely produces four identical sequences 
delayed with respect to one another. In view 
of the “shift and add” property of maximum 
length sequences,® similar results can be 
achieved by suitable modulo-2 addition of the 
various flip-flop outputs. By selecting the ar- 
rangement which produces four sequences de- 
layed by approximately 8 million bits from one 
another, four essentially uncorrelated noise out- 
puts are obtained. The main external difference 
between this method and the sampling method 
is that here the noise sequences are still 4 Me, 
whereas before they were reduced to 1 Me. 

3.4 Digital Equipment Requirements 

Using commercially produced (Computer 
Control Co.) 5 Me plug-in cards, the entire 
digital system (25-stage shift-register, 3 
modulo-2 adders, and the multiplex logic) costs 
approximately $2,100. The following compo- 
nents are required : 

1. Eight MF-35 flip-flop cards (4 flip-flops 
per card) . 

2. Four DI-35 NAND gate cards (8 NAND 
gates per card) . 

3. One PN-35 power amplifier card (4 power 
amplifiers per card) . 

4. One mounting rack ( 19 card capacity) . 

5. Vector patchboard and frame (300 con- 
tacts). 

The design of the front panel and associated 
patchboard of the ASTRAC II Noise Generator 
is shown in Fig. 7. 


Removeable 

Potchboard 


ON 

(SP 


OFF 


Figure 7. Front View of Noise Generator Panel. 

3.5 Level-Shifting and Clamping Circuitry 

To obtain precision-limited binary sequences 
of ±6 volts, it is necessary to pass the digital- 
noise sequences through level shifting and out- 
put clamping circuits. The voltage level shift 
is required since the standard outputs of the 
C.C.C. digital modules are — 6 and 0 volts. 

Shown in Fig. 8 is the circuit used to accom- 
plish the level shifting and clamping operations. 
Transistors Qi and .Q2 form a Schmitt trigger. 
Qs provides a current source for the Schmitt 
trigger, and Q4 provides a clamped emitter fol- 
lower type output. The rise time is less than 50 
nanoseconds as is the delay through the circuit. 



Figure 8. Level-Shifting and Clamping Circuitry. 


3.6 Control of the Mean and Mean Square 
Value^'^ 

3.6.1 Mean Value 

At high digital-clock frequencies (10 Kc to 
4 Me) used in typical repetitive computer appli- 
cations (10 cps to 1 Kc repetition rates), the 
DC level E [x] of the noise generator is best 
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controlled by a DC blocking filter with the 
transfer function 

H(S) = (1) 

where T = RC (Fig. 9) is an appropriately long 
time constant. For “slow” analog computation, 
symmetrical precision clipping of the binary 
noise output is usually sufficient to control E 
[x] , or the active filter of Fig. 10 may be used 
to generate the transfer function (1). 



during any one computation, the peak-to-peak 
voltage will be monitored during computation 
by oscilloscope comparison with a DC reference 
voltage read on the computer digital voltmeter. 
At low clock frequencies, simple analog pre- 
cision limiters may be used to clip the binary 
waveform, with an accuracy of approximately 
0.1 per cent. Still, at high clock frequencies 
improved transistor clamping circuits, as op- 
posed to more cumbersome AGC-type control, 
would be useful in regulating the mean square 
value. 

IV. EXPERIMENTAL TESTS 

The following experiments are now being 
conducted with the aid of the ASTRAC I repeti- 
tive statistics computer. 


Figure 9. Passive DC Blocking Filter. 



Figure 10. Active DC Blocking Filter. 

3.6.2 Mean Square Value 

Finite time averages of the squared binary 
noise output x^ are not subject to fast random 
variations, and equal one-fourth of the squared 
peak-to-peak output when a DC blocking filter is 
used. However, even with a constant load 
(operational-amplifier input), the, peak-to-peak 
output of the clamped emitter follower in Fig. 
8 is subject to slow drift due to temperature 
variations of the break points of the clamping 
diodes. For example, a 10° F temperature 
variation could change the peak-to-peak output 
as much as 30 millivolts, or 0.25 per cent of the 
nominal 12 volt value. This corresponds to a 
0.5 per cent change in the mean square output 
E [x^] . Although drift should be negligible 


4.1 Time Average and Mean Sqvnre Value 

The time average of the pseudo-random noise 
may be measured by filtering it with a low pass 
filter. The filter output is then sampled and 
converted into the appropriate number for a 
digital counter. The direction of the count is 
determined by the polarity of the sample. When 
the preset run counter stops, say, at m computer 
runs, the number indicated in the counter will 
be 

m 

^ Xk m X 
k=:l 

Alternately, the time average may be found by 
using an analog integrator with a long time 
constant. 

By using a diode squaring circuit and the 
same “sample and count” method used to find 
the time average, the mean square value is ob- 
tained. After m runs the counter will read* 

m 

X X = m F 2 
k = 1 

4.2 Amplitude Distribution 

This measurement is made by using the 
Amplitude - distribution Analyzer built into 
ASTRAC I. The analyzer uses a slicer-circuit 
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which enables its output counter to register one 
pulse per computer run if and only if the ana- 
lyzer input voltage x(t) lies between the preset 

values (x — and . The result- 

ing count estimates the Probability that 
j^X— ^<x (ti) ^ X-h 

If AX is made sufficiently small, an approxima- 
tion of the density function is obtained.^ 

4.3 Correlation Experiments 

A unique method for measuring the autocor- 
relation function of pseudo-random noise con- 
sists of wiring two pseudo-random generators 
to produce identical maximum length sequences. 
One of the generators is then driven at a clock 
frequency of fi and the other at a clock fre- 
quency of fi -j- Af, where Af is very small com- 
pared to fi; thus the noise generator outputs 
appear with a continuously varying delay with 
respect to one another. The generator outputs 
are then multiplied together and low-pass fil- 
tered to give the approximate autocorrelation 
function for direct oscilloscope or recorder 
presentation.*® 

A more accurate estimation of the autocor- 
relation function may be obtained by using the 
ASTRAC I computer to generate the following 
functions®: ''x(ti) + '‘x(ti -j- t) and '‘x(ti) — 
‘xCti + t), k is the kth computer run and the 
samples at ti and ti -f r are taken during each 
computer run. The two functions are then 
square to give 

['‘x(ti) + '‘x(t, + t)] 2 and 
[Kx(t,)— ■‘x(t, + r )]2 

After m runs, the final result stored in the 
counters is 



m/2 

= ^ '‘x(ti) '‘x(ti -I- t) 

k = l 


For a large number of runs, 

. , m/2 

estimated R„ (t) = — ^ '‘x(ti) '‘x(ti -|- t) 

in 

k = 1 

4.4 Probability Distributions of the Pseudo- 
Random Sequence 

Assuming the sequence period is large, bi- 
nomially distributed random numbers q are 
obtained by counting the noise generator “I’s” 
occurring during Q clock periods. 



E {q} = Q/2 

Var {q} = Q/4 

Note that, the random variable q is asymptoti- 
cally normal as Q -> te. Thus, the random 
variable u, where 

_ q — Q/2 _ 2q — Q 
" - Q/4 - Q 

converges in probability to a standardized nor- 
mal random variable. 

V. CONCLUSION 

In conclusion, it should be pointed out that 
such a hybrid noise generator has several strik- 
ing features not generally shared by existing 
analog noise generators : 

“1. The analog noise output is independent of 
any physical quantity other than the out- 
put clamping levels. 

2. The shift-register can be reset at any time 
to repeat a sequence of random events 
exactly. 

3. The binary noise bandwidth is propor- 
tional to the shift register clock rate, 
which can be changed (and modulated) 
at will. This permits exact time scale 
changes or intermittent operation. 

4. The noise generator produces digital com- 
puter random numbers as well as analog 
noise. Random digital words are easily 
stored and transmitted and also may be 
used to produce binomially distributed 
random analog coefficients with the aid of 
D/A-converter multipliers. 
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5. Digital multiplexing yields multiple un- 
correlated noise signals from a single 
shift-register. 

6. The binary output permits direct logical 
or analog multiplication. In addition, the 
group-alphabet property of shift-register 
sequences yields delayed sequences, with 
the aid of digital logic; which is useful 
for correlation experiments.” ^ 

7. The flat spectrum binary noise yields 
Gaussian noise with any reasonable spec- 
trum by filtering; diode function gener- 
ators may then be used to produce differ- 
ent amplitude distributions. 

8. One noise generator may also be broken 
into two or more completely independent 
generators if desired. 

In view of the versatility of pseudo-random 
noise generators and the reliability and decreas- 
ing prices of digital-logic modules, the conven- 
tional analog noise generator may well become 
obsolete. 
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APPENDIX 
Multiplex Circuitry 

To obtain the four staggered 1 Me sampling 
signals, a Gray-Code logic circuit driven by the 
4 Me ASTRAC II clock is used (Fig. A) . The 
Gray-Code effectively divides the clock input by 
four. The required sampling pulses exist at the 
outputs of the four NAND gates. They are 
denoted by ®, (D, (3), and @ (Fig. B). The 
multiplexing is accomplished by inverting one 
of the sampling pulses, for example ®, and 
applying it to the AC input of a flip-flop. The 
inversion is necessary for correct triggering. 
The 4 Me pseudo-random noise sequence and its 
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NOISE GENERATOR RESET LINE 
1 1 “ 



Derivation of the Autocorrelation Function 
(t) of Maximum Length Sequences^^ 

(For T equal to an integral number of bits or 
clock periods) 

Previously, the pseudo-random noise from a 
shift-register has been considered to be a se- 

. TIME— » 

.:iJirLJirLriJirLrLrL 

o| 

A 

-el 


. 0 
A 

-6 



Figrure A. Gray-Code Circuit. 

complement are applied to the flip-flop Set and 
Reset Level Controls as shown in Fig. C. The 
flip-flop output is then the desired 1 Me pseudo- 
random noise. The other three 1 Me noise 
sources are similarly derived. 

Two staggered 2 Me sampling pulses are 
easily obtained by NAND-gating pulse trains 
(T) and @ to give the first 2 Me sampling signal 
and NAND-gating @ and ® to give the second. 
They are then applied to the AC input of a flip- 
flop, as was done with the 1 Me sampling pulses, 
to obtain two uncorrelated 2 Me maximum 
length sequences. 

A Gray-Code circuit is used instead of two 
cascaded flip-flops to provide the “divide by 
four” operation, because only one of the flip- 
flops shown in Fig. B changes states per clock 
pulse. On the other hand, with two cascaded 
flip-flops, both go through a transition of states 
every second clock pulse. This would create un- 
desirable coincidence spikes when the cascaded 
flip-flop outputs were NAND-gated to obtain 
the four staggered 1 Me sampling trains. 



Fi^re B. Four Staggered 1 Me Sampling Pulses. 


NOISE GENERATOR RESET LINE 



I Me. 
Ptutdo- 
Random 
Nolto 


Y=4Mc. Pseudo ~ Rondom Noise 
Ys Complement of 4 Me. Pseudo — Rondom Noise 


Figure C. 
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quence of “O’s” and “I’s”, representing the two 
stable states of the flip-flop stages. If instead, 
the two stable flip-flop states are designated as 
“1” and “ — 1” respectively, it can be shown that 
modulo-2 addition with “0” and “1” is equiva- 
lent to arithmetic multiplication with “1” and 
« 


A 

B 

A.B 

—1 

1 

—1 

1 

—1 

—1 

1 

1 

1 

—1 

—1 

1 


A 

B 

(A-fB)MOD 2 

1 

0 

1 

0 

1 

1 

0 

0 

0 

1 

1 

0 


Thus, the “shift and add” property for “1” and 
“0” sequences becomes the “shift and multiply” 
property for “ — 1” and “1” sequences. Since 
for maximum length sequences of “I’s” and 
“O’s”, it is true that 

X(t) -fX(t-l-T,) =X(t-f-T2), 


Since 1 bit is equivalent to 1 clock interval in 
time, the period T = 2" — 1 bits. Now substi- 
tuting equation (1) into equation (3) and ex- 
pressing the limits in terms of n, we have 


2 ”. 


Rii(ti) = 2 

RixCn) = 

1 


2" 


times 


x(t -f- T 2 ) dt 


-11 



(5) 

Number of “1 

"s” in 

the sequence 

minus 

the number of 


in the sequence 



Since the all “1” state (or equivalently, the all 
“0” state in terms of modulo-2 addition) is a 
forbidden condition, a maximum length se- 
quence always has one more “ — 1” state than it 
does “1” state. Consequently, 


Rj:x(ti) — 2n ^ 1 Tl=?^0 (6) 


where ti and t 2 are equal to some integral num- 
ber of clock periods. It follows that for maxi- 
mum length sequences of “ — I’s” and “I’s”, 

x(t) X(t-I-Tl) =X(t-l-T 2 ). (1) 

The time autocorrelation function for a func- 
tion x(t) is by definition 

T/2 

Rxx (t,) = lim ^ J xCt) x(t -h r,) dt. (2) 

T“> 00 

—T/2 

When X (t) is periodic, as is the case for a maxi- 
mum length shift-register sequence, no limiting 
process is needed. Therefore, the autocorrela- 
tion equation becomes 

T/2 

Rxx(ti) = Y / x(t) x(t-|-r,) dt. (3) 
—T/2 


If Ti = 0, the autocorrelation function may be 
written, 

2 " — 1 
2 

R«(0). = -2 ^1 / 

— ( 2 - — 1 ) 

2 

2 " — 1 
2 


— ( 2 " — 1 ) 
2 

= 1 


Derivation of the Power Density Spectrum 
Gj.j (f) of Maximum Length Sequences 


00 

G„(f) = F (R„(t)) = j R.,(r)e-2-*" dr 


00 
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using a Fourier series representation for Rkx(t) 

« T/2 

Rxx(t) = ^ rai“ 2’^ ,ra = ^ j R(T)e-^» dr, fo = ^ 

o= — 00 — T/2 

OO 00 

Gxx(f) = J Y, 

00 a = 00 

since the Fourier series for Rxx(t) converges uniformly, the order of integration and summation 
can be reversed, giving 

00 00 

Gxx(f) =^ra 


00 

= ^ ra S (f — afo) 

o — 00 


T/2 

— f R(r) e-^»2xf.r drj S(f — afo) 
—T/2 
T/2 =0 

dr ^ 8(f — afo) 

—T/2 a = —00 

00 

= -^GT(f) Y — 

a=: — 00 


= I [ 

a. — — QO 


T/2 

where GiCf ) = j Rxi(’’) e dr is the Fourier transform of Rxi(r) defined over one period. 

—T/2 


Rxx(r) =i 



I-M 


(Z+1) 

T 


T/Z < |t| < T/2 
lr| < T/2 


where Z = 2" — 1 

n = number of shift-register stages. 
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Evaluating the integral for GxCf ), 

r m - rz-4-ii 

GxCf) - 22 (Z+1) 2wi T/2Z 


T 

"sin 2irf T/21 

Z 

2,rfT/2 J 


. G m-± y (Z+1) pin2.afoT/2Z -| * _ T pin 2. a fp T/2 -]\ 

. ^ ^22 2,raf,T/2Z J Z^ 2«-afoT/2 


a=: <50 


I (^[^] 




o = — 00 
0 


where f* = clock frequency = Zf o. 


n 

Add Modulo-2 the 
Outputs of Stages 

10 

3,10 

11 

2,11 

15 

1,15 

17 

3,17 

18 

7,18 

20 

3,20 

21 

2,21 

22 

1,22 

23 

5,23 

25 

3,25 

28 

3,28 

31 

3,31 

33 

13,33 


(n equals the number of stages) 


Those values of n between 10 and 83 not appear- 
ing above can not produce a maximum length 
sequence with one modulo-2 adder. 

FEEDBACK CONNECTIONS REQUIRING 
ONLY ONE MODULO-2 ADDER TO PRO- 
DUCE A MAXIMUM LENGTH SEQUENCE : 
n = 10 through n = 33 

TABLE 1 


afo) 
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INTRODUCTION 

The advent of fast iterative analog com- 
puters^’2’3 poses the requirement for storing 
analog functions for integral multiples of the 
computer repetition period which is typically 
of the order of 1 to 10 milliseconds. In addition 
to function storage for one computer run, it 
would be desirable to store analog functions for 
indefinite time intervals to permit table lookup. 
Typical accuracy requirements might be 0.1 to 
0.5 per cent of half -scale at ten times the com- 
puter repetition frequency (10 cps to 1000 cps) 
with somewhat lower accuracy at higher fre- 
quencies.^ 

For most iterative-differential-analyzer ap- 
plications, existing function storage schemes, 
such as capacitor wheels, reed-relay matrices, 
cathode-ray storage tubes, storage integra- 
tors, lumped-parameter operational amplifier 
schemes, and magnetic recording techniques 
all distinguish themselves by varying degrees of 
impracticality, high cost and low accuracy. 

Magnetostrictive (acoustic) delay lines have 
been used successfully in digital computers.'^ 
These lines are highly developed and widely 
available in packages which include all write 
and read circuitry. Bit rates up to 5 Me with 


several milliseconds of delay are possible. Pulse 
regeneration with clock-gated logic permits cas- 
cading of delay line sections and indefinite re- 
circulation of digital data. 

The magnetostrictive delay line is for all prac- 
tical purposes designed for digital pulsed sig- 
nals. Consequently, in order to use this type of 
line it is necessary to encode the signal into a 
pulse modulated form. Pulse amplitude modu- 
lation is unsuitable because attenuation and 
reflection in the line result in poor signal to 
noise ratios. Since pulse frequency modulation 
and pulse position modulation require precise 
timing circuits to effect pulse regeneration, it 
would seem practically impossible to use these 
schemes for indefinite recirculation of analog 
stored data. 

True pulse code modulation (PCM) is with- 
out question the most efficient modulation 
scheme. PCM will, however, require a complete 
set of A to D and D to A converters to achieve 
modulation and demodulation. This method is 
used in the Electronics Associates HYDAC com- 
puter^ but true PCM seems prohibitively ex- 
pensive for all but the largest computer instal- 
lations. 

Delta modulation,® i.e., pulse modulation 
through suppression of pulses in a constant 
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frequency clock pulse train, appears especially 
attractive, for it combines inexpensive modula- 
tion and demodulation with the possibility of 
pulse regeneration with clock gated logic just 
like true PCM. Conventional delta modulation 
which transmits the time derivative of an 
analog input is suitable for voice communica- 
tions, but not for transmission of absolute d-c 
levels. 

Inose et. al.^® circumvented this drawback to 
delta modulation by modifying the transmitter 
so that the pulse train represents information 
about the signal rather than its derivative. This 
modified delta modulation or delta sigma modu- 
lation meets the requirements for an analog 
signal encoder which can handle signals extend- 
ing down to zero frequency. The modification 
has not altered the intrinsic simplicity of the 
delta modulation system making it very attrac- 
tive for medium size computer installations. 

CIRCUIT OPERATION 

Referring to the block diagram of Fig. 1, the 
input signal is summed into an integrator along 
with the flip-flop output signal, -F(t). The 
integrator output sets a two-level comparator 
which then determines whether a gating circuit 
will allow a set or reset pulse to pass. If the 
integrated sum of the input and output signal is 
less than zero at the “sampling’' time a reset 
pulse is gated out. If the amplitude of the inte- 
grated sum is greater than zero a set- pulse is 
released to the flip-flop. 

The waveforms which appear in the system 
for a zero input signal are shown in Fig. 2. An 



Figure 1. Block Diagram of Analog Delay Unit. 



Figure 2. Modulator Waveforms for Zero Input Signal. 

examination of these waveforms will clarify the 
behavior of the system. If the transmitter is 
in a non-saturated condition, the feedback loop 
will attempt to keep the integrator output close 
to zero. Under these assumed conditions the 
integrated difference between the signal and the 
flip-flop waveform will be less than some arbi- 
trary number e, or 

t 

I (s(t) — F(t))dt^e(t) (1) 

o 

If one considers the quantity e to be small then 
the flip-flop output F (t) is approximately equal 
to S(t) in an average value sense. Therefore, 
to recover the modulating signal it is only nec- 
essary to insert the flip-flop output into a low 
pass filter; the output of the Schmitt trigger 
may also be used for this purpose. As used for 
computer memory, gated clock pulses taken 
from one of the feedback lines to the flip-flop 
are sent down the delay line and used to operate 
another flip-flop in the receiver rather than at- 
tempting to transmit and reshape the wider 
pulses occurring in the flip-flop’s output. 

Five megacycle Computer Control Corp. 
(CCC) transistor logic cards were used for the 
digital logic along with Burr-Brown ± 10 v 
operational Type 1607A amplifiers. The demod- 
ulator is a passive RLC filter of the Tchebysheff 
II t 3 q)e“ designed for 50 kc cutoff and preceded 
by a wideband operational amp. (10 me 0-db 
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Figure 3. Demodulating Filter Response. 



Figure 4. System Response of Complete AS System. 



INPUT VOLTAGE 


bandwidth) . Filter cutoff at 50 kc seems to be 
about optimum for a 2 me bit rate ; values above 
this admit excessive noise into the output. The 
demodulator has objectional phase shift above 
1 kc (see Fig. 3 for filter response, which was 
improved, at the expense of some amplitude 
response, by lead networks at the input. Fig 6.) 
The overall system response is shown in Fig. 4. 
The uncompensated system is flat out to 40 kc. 
Fig. 5 shows the static input output character- 
istic. This curve was obtained with a digital 
voltmeter and is seen to be quite linear. 

The output pulse widths of the CCC Delay 
Multivibrator Cards were narrowed to about 
half their normal minimum value. It was found 
necessary to add several additional logic cards 
to what could be considered a minimum compo- 
nent delta sigma system in order to prevent 
cross coupling effects from causing pulse-train 
jitter. Such jitter makes pulse reshaping more 
difficult, but has no noticeable effect on other 
aspects of system performance. 

Operation of the modular circuit, shown in 
Fig. 6, is straightforward. The 2 me clock trig- 


Dl -35 



Figure 6. A 2 Modulator Demodulator. 


Figure 5. Static System Response. 
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gers a delay multivibrator, whose complemen- 
tary output pulses are NAND gated so that one 
or the other is allowed to reach flip-flop FFl. 
The flip-flop output is added to the analog input, 
and the sum is integrated. The integrated out- 
put is fed into a Schmitt trigger which in turn 
controls the NAND gates in such a manner that 
feedback of the flip-flop output always tends to 
return the integrator output to zero. As a re- 
sult, the running average of the output pulse 
F (t) is proportional to the analog input to the 
integrator. 

SYSTEM RESPONSE 

Although it is difficult to determine the modu- 
lator response or pulse code pattern for an arbi- 
trary input, it is instructive to consider the sys- 
tem response to selected inputs. 

As Fig. 2 indicates, the flip-flop output for a 
zero input signal consists of a square wave with 
the transitions occurring at the clock intervals. 
Consider an input step voltage equal to one half 
the flip-flop voltage. The flip-flop pulse pattern 
which would result from this input is displayed 
in Fig. 7 along with various other waveforms 
in the system. The average value of the flip- 
flop voltage F(t) is Eo/2, one half its peak 
value where E,, is the amplitude of the flip-flop 
output. The fundamental frequency of the 
waveform for this input is 1/4T where T is the 
clock period. If this waveform is averaged by a 
low pass filter, the steady state output of the 
filter would be E/2 volts or the input signal. 


Input S (t) 



F(t) 















Figure 7. Modulator Waveforms for Half Full 
Scale Input. 


0 fc Frequency — > 

Figure 8. Ideal Low-pass Filter. 

If the system is to be responsive to step volt- 
ages in a reasonable amount of time, it is nec- 
essary to keep the bandpass of the averaging 
filter as large as possible. On the other hand, 
too large a bandpass would permit an unaccept- 
able amount of ripple to appear at the output. 
If one assumes the existence of an ideal low 
pass filter as shown in Fig. 8 and if the cutoff 
frequency f^. is set below 1/4T cycles per sec- 
ond, no ripple components will appear in the 
steady state output. Unfortunately, the ripple 
frequency is a function of the input level and 
for inputs close to ± Eo volts, the fundamental 
frequency of the flip-flop waveform becomes 
arbitrarily small. In fact, for input levels equal 
to ± Eo the system reaches the limits of its 
dynamic range and the flip-flop remains perma- 
nently in one state. For an averaging filter with 
a bandpass fc cycles per second the static range 
of the input must be limited to^® 

(fcT) Eo <E,„ <(l— f,T)Eo. (2) 

In a dynamic situation, the input signal may 
have a maximum amplitude larger than Eo 
volts. The stable pulse pattern responsible for 
Equation 2 does not result unless the level re- 
mains constant for periods of time commensu- 
rate with the averaging bandwidth, thus this 
dynamic range constraint does not hold for time 
varying signals. 

Triangle and square waveform response of 
this system were checked from .01 to 1200 cps. 
Amplitude response is linear and once the input 
and output are superimposed on a dual trace 
scope they remain superimposed as the input 
amplitude is varied throughout the system 
range. Waveform response for a 2 me bit rate 
is shown in Figs. 10a through f. Input com- 
pensation was removed since it is not correct 
when a variable bandpass filter is used. 

Triangle waveform response at 500 cycles 
along with the modulating signal is shown in 
Figs. 9a and b for two values of demodulating 
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Figure 9. Waveform Response of A 2 System. 


filter cutoff. Fig. 9a shows a wide filter setting 
of 110 kc in order to show quantizing effects of 
the system. Figure 9b shows a more normal 
filter setting of 20 kc which filters out much of 
the noise and effectively recovers the modulat- 
ing signal. Square waveform response at 500 
cycles is shown in Figs. 9e and d for two values 
of demodulating filter cutoff frequency. Figure 
9c is for a 80 kc filter and Fig. 9d for a 20 kc 
filter. The only effect of filter width on a square 
waveform is in the height of the grass on the 
horizontal segments. The extra height hides 
the overshoot in the wide filter. Note that there 
is no measurable difference in the rise time of 
the input and output and that the signal re- 
sponse is only dependent upon the characteris- 
tics of the demodulating filter. 

Sine wave response was checked from .01 cycles 
up to 600 kc.i^ While the system will still re- 
spond at these high frequencies, noise content 
becomes objectionable above 100 kc and sine 
wave response was not considered useful above 


this frequency. At frequencies below 50 kc 
amplitude response is excellent with the same 
general properties commented on for triangle 
and square waves at lower frequencies. Sine 
wave response is shown in Figs. 9e and f for 
a 50 kc filter and a 500 cycle and 20 kc sine wave 
respectively. In Fig. 9e the input is the lower 
amplitude wave while in Fig. 9f it is the wave 
occurring first in phase. At 20 kc the demodu- 
lating filter phase shift is quite noticeable. As 
predicted in the S/N ratio section there is less 
noise associated with the high frequency wave- 
form which is close to the filter cutoff frequency. 

ADAPTIVE FILTERING 

It is apparent that the requirements for good 
noise suppression and high frequency response 
do not coincide. However, it is possible to vary 
the bandwidth of the low pass filter, leaving it 
wide for rapidly varying signals and decreas- 
ing it for low bandwidth signals. 

Since the demodulator or low pass filter is 
on the receiving end of the delay line, the shape 
of the signal is known five milliseconds before 
the pulse code reaches the filter. It is, therefore, 
possible to examine the signal at the input of 
the modulator and insert either a wide or nar- 
row bandwidth demodulator depending on the 
nature of the input. The rate sensing circuit 
(Fig. 10) consists of an R-C differentiator feed- 
ing an absolute value circuit which sets a two 
level comparator. The demodulator then acts 
as an adaptive filter whose bandwidth is varied 
commensurate with the requirements of the in- 
put signal. 


DYNAMIC RANGE 

The system exhibits little, if any, threshold 
effect on input level below which the system 
does not operate. Measurable amounts of thres- 



Figure 10. Block Diagram for Demodulating Filter. 
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hold can be introduced if the integrator capaci- 
tor is bypassed with a resistor; the amount of 
the threshold increases as the integrator time 
constant is decreased. Decreasing inputs in the 
unbypassed system eventually becomes lost in 
the system output noise at about — 40 db below 
saturation. This figure can be improved by suf- 
ficiently narrowing the bandpass filter. With a 
narrow filter, 200 cps, inputs ranging from 8v 
down to 5 mv or over a 64 db dynamic range 
can be detected. The dc level adjustment on the 
integrator input was always set to center the 
input in the dynamic range, i.e., saturation 
occurs at the same level for positive and nega- 
tive excursions. This is the same dc level which 
gives an alternating on-off pulse pattern at the 
output for zero input, as shown in Fig. 2. 


OUTPUT SIGNAL TO NOISE RATIO 


Delta sigma modulation is a pulse code modu- 
lation system and like true PCM systems, is sub- 
ject to quantizing noise. Fig. 9a is a photograph 
showing the output signal for a triangle wave 
input. The quantizing phenomenon is quite ap- 
parent from the photograph. Inose,*® using a 
somewhat simplified model, derives an expres- 
sion for the signal to quantizing noise ratio as : 




(3) 


where M is the ratio of the signal amplitude to 
the dynamic range, fr the clock frequency and L 
the cutoff frequency of the low pass filter. 


There is one significant respect in which the 
quantizing phenomenon of a delta sigma sys- 
tem differs considerably from that of a true 



Figure 11. Equipment Setup for Measurement of 
Signal to Noise Ratio and Spectrum. 


PCM system. If the input signal is a constant 
d"C level, the modulator feedback will adjust the 
pulse rate so that the average value of the flip- 
flop waveform F(t) equals the input voltage. 
Hence, no measurable quantizing errors are 
encountered in the transmission of d-c levels. 
On the other hand a conventional PCM system 
initially quantizes the input signal, and the 
maximum error which may occur at this point 
is a/2 where a denotes the voltage of a quantiz- 
ing interval. 

Signal to noise ratio was measured as a func- 
tion of bit rate, input amplitude, filter cutofT 
frequency, modulating frequency and integra- 
tor time constant. The results are plotted in 
Fig. 12 against the theoretical Equation 3 as 
shown in Fig. 11a. The normal fixed low-pass de- 



Figure 12a. A2 S/N Ratio Using Four 
Modulating Frequencies. 


modulating filter was replaced by an adjustable 
bandpass filter for use in the spectrum analysis. 
This was an active Kron Hite which has a cut- 
off of 24 db per octave and was modified to have 
a fixed low frequency cutoff of about 5 cycles. 
The HP Distortion Analyzer contains an 80 db 
rejection filter. The filter output was read on a 
true rms Ballantine voltmeter for accurate 
power evaluation. The measurement technique 
was to adjust the distortion analyzer to a con- 
stant output level on the Ballantine, tune out 
the fundamental, measure the new output and 
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Figure 12b. A2 S/N Ratio as a Function of Bit Rate. 



Figure 12c. A2 S/N Ratio Using Four Filter 
Cutoff Frequencies. 

calculate the S/N ratio. The distortion analyzer 
has a bandwidth of 80 kc which had negligible 
effect on the S/N ratio of test signals. 

The exact value of M for this system is diffi- 
cult f o determine since the modulator saturates 
slowly and some judgment must be used in 
evaluating the maximum amplitude. Also most 
of the distortion resulting from mild saturation 
levels can be removed by sufficiently narrowing 
the demodulating filter. Saturation for these 
measurements was determined as follows. De^ 
modulating filter cutoff was set to one tenth the 
bit rate and its output observed on a scope. The 


input level was reduced from obvious satura- 
tion until a continuous grass just formed across 
the top of the demodulated output. Using this 
method, M was found to be independent of bit 
rate, input frequency and of course filter cutoff. 
Below a minimum value of integrator time 
constant, about 3T or 150 nsec (Fig. 12a) the 
saturation level decreases as T decreases, other- 
wise M is also independent of T; this arae in 
addition shows a poor S/N ratio. All other tests 
were performed at a time constant of 40T. 

It is noted that in most instances the meas- 
ured value of the S/N ratio is higher than the 



Figure 12d. A2 S/N Ratio Using Three 
Modulating Frequencies. 



Figure 12e. A2 S/N Ratio Using Four Bit Rates. 
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theoretical prediction of Equation 3. For Fig. 
12b the values chosen were such as to be pro- 
portional to a similar plot given in Reference 
10 for which the experimental curve has the 
same shape, but is a few db below theoretical 
value. 

It is noted in Fig. 12c that the S/ N ratio, as 
predicted by Equation 3, is independent of mod- 
ulating frequency except at the high end of the 
modulating range. In all cases the S/N ratio 
increases as the modulating frequency ap- 
proaches within four octaves of the filter cut- 
off frequency. This is not predicted by Equa- 
tion 3 but is anticipated from spectrum studies. 
Since the modulating frequency is always the 
lowest spectrum line more harmonics will be 
removed from the total noise content of the out- 
put as the fundamental frequency approaches 
the filter cutoff frequency. For a perfect filter 
only the fundamental and noise would be left 
when the fundamental is within one octave of 
the cutoff frequency. While the noise power is 
present across the entire filter bandpass, its 
level is very low for high bit rates (see Figs. 
13d, e, f). 

S/N ratio as a function of M is shown in Fig. 
12d. It is noted that the theoretical and experi- 
mental curves for low M follow quite closely 
over a wide range of modulating frequencies, 
falling off only above .6M. This result is 
derived theoretically by consideration of a 
simplified modulating system so constructed 
that it may be analyzed mathematically. Simpli- 
fication is achieved by the elimination of the flip- 
flop from the feedback loop of the delta sigma 
system and removing one of the feedback lines. 
This results in the gated clock pulses being fed 
directly to the integrator input. The output of 
the integrator then becomes a series of rectan- 
gular pulses which can be expressed as a Fourier 
transform and multiplied by the demodulating 
filter frequency response function. Theoretical 
results obtained from this simplified system 
parallel quite closely the experimental results 
obtained from the Delta-Sigma system. 

Figure 12e shows the effect of varying the 
filter cutoff. While the theoretical and experi- 
mental curves tend to compare favorably at high 
cutolf frequencies, the S/N ratio falls off in- 
stead of continuing upward with continued de- 


crease of the filter cutoff frequency as predicted 
by Equation 3. Figure 12c also illustrates this 
point for the 2 me bit rate. 

PULSE-TRAIN SPECTRA AND 
FREQUENCY RESPONSE 

Spectrum studies were conducted on the flip- 
flop output. The test setup is shown in Fig. 11b. 
It was found necessary to impose a filter whose 
bandpass was slightly wider than the analyzer 
in order to prevent mixing of the spectrum at 
the analyzer input. The test procedure was 
simply to select a bit rate and sinusoidal modu- 
lating frequency and to measure the amplitude 
and frequency of the spectrum components at 
the filter output. The ratio of input voltage to 
maximum input, M, as given in Equation 3 was 
maintained at about 0.8. 

Figures 13a to 1 shows pulse-train spectra for 
sinusoidal inputs plotted with frequency in kc 
and amplitude in mv. Besides the input fre- 
quency and some of its harmonics, other fre- 
quency components can be seen. The first few 
harmonics of the input frequency are often 
missing. As an example, with a 50 kc bit rate 
and a 20 kc filter cutoff the spectra for 1, 2, and 
4 kc inputs are shown in Figs. 13a, b, and c, 
respectively; spectra are plotted up to 16 kc, 
the analyzer limit. Note that the second har- 
monic is always missing. In Fig. 13a, most of 
the higher harmonics reappear, starting with 
the eighth. In Fig. 13b all harmonics above the 
first are suppressed with the appearance of a 
new frequency component halfway between 
each harmonic location. In Fig. 13c, the higher 
harmonics are present along with a new fre- 
quency component which is one quarter of the 
way between harmonics. A continuous noise 
spectrum is present for all but zero input; the 
noise level is a function of both modulating 
level and input frequency. This is assumed to 
be the result of intermodulation around the 
loop since there is no measurable output noise 
for zero input. The amplitude of the fundamen- 
tal was always exactly proportional to the input 
level, but the other spectrum lines increase and 
decrease as the input level is increased ; a typi- 
cal result is shown in Fig. 14 for the spec- 
trum in Fig. 13b. Fig. 14a shows the amplitude 
of the fundamental frequency of 2 kc, which is 
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seen to be quite linear with input level, plus the 
next three spectrum lines resulting from the 
50 kc clock. Fig. 14b shows the amplitude of 
the final two spectrum lines obtainable with the 
test setup and the noise level as a function of 
input voltage. 




Figure 13. Spectrum Study of Flip-flop Output. 


It was noted that spectra containing exclu- 
sively harmonics of the fundamental occur only 
when the bit rate is an integral multiple of the 
modulating frequency. The converse of this is 
not always true, however, as can be seen in 
Fig. 13b. 

Spectrum analysis was also performed for the 
2 me bit rate and 20 kc filter cutoff up to 16 kc. 
Under these conditions only harmonic frequen- 
cies of the fundamental were present. Typical 
spectra are shown in Figs. 13d, e, and f. For 
bit rates several orders of magnitude higher 
than the modulating frequency, the question of 
even division depends on the fourth and higher 
digits in the clock frequency, and these are sub- 
ject to random drift. Under these conditions 
only harmonics of the fundamental were ob- 
served. These amplitudes were much less than 
that of the fundamental. 

Spectrums for low bit-rates are shown in 
Figs. 13g through 1. This allows the spectrum 
analyzer to cover more of the spectrum and 
gives a better insight into system operation. 
Figs. 13g, h, and i are for a 16 kc bit-rate in 
which the entire spectrum may be studied. In 
Fig. 13g there is no input and the main compo- 
nent is at one half the clock frequency as has 
been previously noted in Fig. 2. There is only 
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INPyi VOLTAGE RATIO 

Figure 14a, Spectral Line Amplitudes Versus M. 



Figure 14b. 


a small clock component and no measurable 
noise (note logarithmic millivolt scale). In Figs. 
ISh and i the fundamental frequency evenly di- 
vides the clock frequency and all signal har- 
monic frequencies are present. Note that the 
amplitude of some harmonics are greater than 
the fundamental. 

A bit-rate of 32 kc is used for Figs. 13j, k, 
and 1 allowing half of the spectrum to be ob- 
served. In this case nonfractional modulating 
frequencies are shown. Fig. 13j has many of 
the harmonic frequencies missing and shows 
no pattern of recognizable order as compared 
‘‘with the two spectrums to be described below. 
Fig. 13k has all signal harmonics present plus 
additional frequencies four tenths of the way in 
between successive harmonics. Fig. 13-1 also 
has all harmonics present with the interven- 
ing space equally divided by two more spectral 
lines. Figs. 13k and 1 also show some spectrum 
lines with amplitudes above that of the funda- 
mental. 
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CONCLUSION 

Providing time delay and memory in an 
analog computer opens up new classes of prob- 
lems which need to be solved. Simulations in- 
volving transport lags, correlation studies and 
communications problems require the formation 
of time delays. Iterative techniques for solving 
integral equations require analog function stor- 
age. 

The use of a magnetostrictive delay line for 
the storage medium is attractive because of the 
success this technique has had in digital com- 
puters. However, information must be intro- 
duced into this line in a digital manner. Al- 
though PCM is the most efficient method of 
digitizing, its equipment complexity and high 
cost make it unattractive to small computer in- 
stallations. Delta sigma modulation, which en- 
codes an analog signal into a digital code, is an 
inexpensive technique which has excellent line- 
arity and good noise characteristics. Using this 
method of modulation in conjunction with a 
magnetostrictive delay line, 5 to 10 milliseconds 
delays may be obtained at 2 me bit rates. The 
device will accommodate analog signals of up 
to 8 kc with phase shift below 2 degrees. The 
total dynamic error is within .2 per cent of half 
scale up to 1 kc. 

Magnetostrictive delay lines are limited at 
present to pulse rates of 5 me. However, ad- 
vances which will be made in high speed digital 
storage techniques can be utilized for analog 
storage since the analog signal is encoded into 
digital form. This system, due to its fixed spac- 
ing of narrow pulses, also offers the possibility 
of multiplexing. That is, one delay line with its 
read-write circuitry could be used to store sev- 
eral analog functions. This would, however, 
bring back the problem of the use of precise 
timing circuits in order to separate the pulses 
belonging to each analog function. 

APPENDIX 

Delta Sigma Modulator System Equations 

Consider the basic delta sigma modulator 
as shown in Fig. 1. The integrator output, €(t) 
is given by 


(A-1) 


where S(t) is the input signal and — F(t) is 
the flip-flop output. Consider Equation A-1 
evaluated at t = nT where T denotes the clock 
period and n an arbitrary integer. 

^ t (nT) = S (nT) — F (nT) ( A-2) 


— € (nT) may be approximated by 


hence, 


d[(n + l)T] — .(nT) 
T 


e[(n+l)T]— c(nT) 
T 


Now 


= S(nT) — F(nT) 
(A-3) 


F(nT) =1 


+E„ if £(nT)^0 
— E„if £(nT) <0 


so 


F(nT) = E„ 


provided that 


e(nT) 

e(nT) 


i^ = lfor£(nT)^0 
Rewriting Equation A-3, we find 
£[(n-l-l)T] — £(nT) = T S(nT)— E., 

|«(nT) I 
(A-5) 

£(n + 1)T + £(nT) | ~^] = ^ 

(A-6) 


A-6 is the basic equation describing the behavior 
of the modified delta modulation system. Un- 
fortunately, the presence of the absolute value 
term (which has as its counterpart the compa- 
rator in the actual system) does not allow an 
analysis of the modulator on a linear system 
basis. However, Equation A-6 may be solved 
for specific inputs either on a digital com- 
puter,!^ or by a step by step solution. 
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A COMPUTER-SIMULATED ON-LINE EXPERIMENT IN 
LEARNING CONTROL SYSTEMS 

J. D. Hill, G. J. McMurtry, and K. S. Fu 
Control and Information Systems Laboraiory 
School of Electrical Engineering 
Purdue University 
Lafayette, Indiana, 


INTRODUCTION 

The structure of a learning control system 
combines digital memory and logic circuitry 
with an adaptive system. Data obtained 
through adaptation is stored (remembered) and 
later utilized to improve the system perform- 
ance. The learning control system will operate 
satisfactorily under changing environmental 
conditions in which an adaptive system fails to 
improve the performance. The general learning 
system operation is outlined in this paper and a 
simple experimental system which illustrates 
the improved performance is discussed. The 
example system is a second order plant in which 
the damping ratio and undamped natural fre- 
quency are considered to be affected by the en- 
vironment in a piecewise constant manner. The 
system was simulated on a GEDA analog com- 
puter and the memory and logic functions were 
supplied by an IBM 1620 through IBM 1711 and 
1712 analog to digital and digital to analog 
equipment. 

LITERATURE SURVEY 

The words “learning system” and “self- 
organizing” appear frequently in the literature 
in the area of automata studies and artificial 
intelligence. Hawkins^ has given an excellent 
review on self-organizing systems as well as an 
extensive bibliography, while Minsky^ has re- 
viewed various aspects of artificial intelligence 


including hill-climbing, methods of classifica- 
tion for pattern recognition, reinforcement 
learning, problem solving machines and various 
other aspects of artificial intelligence. Nearly 
all of these systems utilize some form of mem- 
ory and do improve their performance with 
time. However, they require an off-line training 
period involving a human operator to judge 
whether or not their response to a given stim- 
ulus is satisfactory, i.e., they lack a built-in 
index of performance. 

A typical example of this is given by Gabor, 
Wilby and Woodcock® who have constructed a 
universal nonlinear filter which will organize 
its internal parameters in such a way as to act 
as an optimum filter, predictor, or simulator of 
an unknown mechanism, depending upon how 
it is trained. The important point is, however, 
that it must be trained by a human operator 
before it can function in the desired manner. 
This is typical of the general class of so-called 
learning or self-organizing pattern recognition 
machines. “Pandemonium,” “Perceptron,”® 
and “Adaline”* are other pattern recognition 
devices with self-organizing capabilities. All, 
however, require off-line training by human 
operators. 

Thus, though some of the techniques which 
have been developed for pattern recognition and 
automata theory may be useful when applied to 
the learning control problem, there has as yet 
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been very little direct application to control 
systems. 

In the hierarchy of automatic control systems, 
adaptive systems are more sophisticated than 
standard control systems. Truxa^ gives a good 
general review of adaptive control systems 
while Mishkin and Braun® discuss several spe- 
cific examples. The next advancement beyond 
adaptive systems appears to be in the area of 
learning control systems. 

Krug and Letskii* have suggested the use of 
a learning process for optimum control. In 
their paper they suggest that the optimum con- 
trol of slow but complex processes, such as 
chemical processes, might be found by a system- 
atic evaluation of input and output data and 
an index of performance. Probabilistic methods 
are suggested for organizing the memory for 
optimum search procedure. The system as out- 
lined is very general and a human operator is 
left to make the final decision as to whether the 
control found by the so-called automaton is 
satisfactory. Only cursory attention is given to 
the possibility of replacing the human operator. 

A considerable effort has been directed to- 
ward the investigation of learning in automatic 
control systems at the Control and Information 
Systems Laboratory, School of Electrical Engi- 
neering, Purdue University. An informal in- 
troduction to learning control systems has been 
given by Fu.^® One of the specific systems in- 
vestigated at Purdue University^* is discussed 
in this paper. 

Philosophy of Learning Control Systems 

The structure of a learning control system, 
as visualized at present, is distinguished from 
an adaptive system mainly in that in addition 
to a conventional adaptive system the learning 
system has memory and logic. Data obtained 
through adaptation is stored and later utilized 
to improve the system performance. Thus, 
whereas an adaptive system may take advan- 
tage only of its immediate past experience, a 
learning system is able to recall and use plant 
adjustment data obtained through adaptation 
while operating under similar environmental 
conditions in past time. An adaptive system 
will optimize a slowly time varying plant to a 
given index of performance (IP) , often through 


a hill-climbing technique, by modifying the con- 
troller or plant parameters. Thus, if a sufficient 
range of parameters is available, the adaptive 
system will become optimum for a given index 
of performance. A basic constraint is that the 
plant should vary slowly enough that the adap- 
tive loop is able to track the minimum of the 
index of performance and maintain relatively 
constant performance. 

In some instances, the plant parameters vary 
so quickly that the adaptive loop cannot main- 
tain optimum performance although some adap- 
tive action does take place. This is one type of 
situation in which the learning system pro- 
posed by the authors may be applicable. Learn- 
ing systems of the type discussed also yield 
superior performance in systems with slowly 
varying parameters which change suddenly but 
are relatively constant between changes. The 
learning system would be designed so that it 
would store in memory a quantized measure of 
the plant parameters which are varying or of 
the environment causing the variation, the best 
index of performance obtained by the adaptive 
system, and the corresponding corrections nec- 
essary to obtain this performance. 

It is important to realize that the hill-climb- 
ing technique used in the adaptive portion of 
the system may not converge to the minimum 
of the IP surface if the measurements of the IP 
are noisy. Thus all environmental parameters 
affecting the IP must be measured. Only if a 
separate hill-climb is performed for each com- 
bination of environmental parameters and all 
environmental parameters are considered will 
the IP surface be free of noise and the adaptive 
portion of the system operate properly. 

Thus when a previously occurring set of plant 
parameters is again encountered, the best 
learned corrections would be immediately set 
from memory and adaptation carried out from 
that point. The system would, of course, be de- 
signed so that the contents of the memory are 
continuously updated by storing the most recent 
best settings of the corrections. 

Assuming then, that the varying parameters 
may be quickly identified, and that the memory 
interrogation is faster than the adaptive action, 
the system performance, after sufficient operat- 
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Figure 1. Gfeneral Plant. 


ing time, would approach the optimum perform- 
ance. 

In the general learning situation the plant 
and plant parameters are shown in Fig. 1. In 
Fig. 1, e = (ei, 02 , . . ., Cn) is the environment 
vector which is considered to be measurable but 
uncontrolled. It includes all plant inputs and 
environmental factors which cannot be changed 
in order to improve the IP. d = (di, d 2 , . . . ,dp) 
is the plant parameter adjustment vector and 
includes plant inputs which may be changed in 
order to improve the IP. c = plant output vec- 
tor, and I is the plant index of performance. 

Now each component ei of the plant environ- 
ment vector e, corresponds to a separate en- 
vironmental or input variable such as tempera- 
ture, pressure, etc. 

For measurement purposes, each of these 
variables, ei, is assumed to be quantized into qi 
levels where the q/s for different values of i 
(1 = 1,..., n) are not necessarily equal. Then 
the number N of possible different environment 
vectors e is 

n 

N=][„ 

i=l 

e will thus be relabeled e = e'' ; r = 1 , 2 , . . . , 
N when referring to the r*** possible environ- 
ment. 

The index of performance, I, is considered to 
be a scalar function of e*", d and c, i.e., 

I = f (e"", d, c) 

where c = F(e'’, d). Thus, for a given e^, I has 
at most a single minimum as a function of d. I 
is then a p-dimensional hyper-surface in the 
adjustment parameters dj (j = 1, 2, . . ., p). 
There results, then, a p-dimensional hill-climb- 
ing problem corresponding to each different cL 


There are several methods of hill-climbing 
the d vector to the minimum of the IP surface, 
but the one chosen here is to hill-climb each 
component of d separately and sequentially, the 
best values of each of the p parameters being 
stored in memory corresponding to each e'', 
along with the corresponding direction of in- 
crease or decrease in each parameter of d and 
the lowest value of I. The stored adjustment 
vector corresponding to e"" is labeled (Su- 
perscript M indicates a stored or memorized 
parameter.) 


The operation of the system is then as fol- 
lows. Upon recognition of the occurrence of e\ 
d is set to d''“ from a search of memory, and 
hill-climbing proceeds. The computer program 
should be arranged to facilitate a fast search of 
memory for previously occurring eL This may 
be implemented by programming the computer 
to search the most frequently occurring e'^’s 
first. The program must also be capable of set- 
ting up new storage locations for e^’s which 
have not previously occurred. 


Now let us consider that a particular 
occurs, given that it has previously occurred k 
times. Immediately (k) is set from memory 
and parameter dj is to be adjusted, where : 


k — a p 
j>0 
fO 0 < k 
1 P + 1 
etc. 


P 

k 


2p 


' These condi- 
tions are ap- 
plied to indicate 
sequential na- 
ture of adjust- 
ment of dj’s as 
mentioned pre- 
, viously. 


Then d/ (k 4 - 1) is adjusted to 

d/(k + l) =d/“(k) + s/«(k) Ad, 


where 

Ad, = increment of d, 

s,'“(k) =±1 (stored direction in which to 
increment d,'’“ (k) ) 


Let us now consider the logic required to pro- 
gram the hill-climbing operation and the modi- 
fication of computer memory. Define 


mr 


, 1 , , _ flfor (P(k-l-l)— P“(k)) ^01 

-t0for(P(k-f D— P«(k)) >0J 


where 

P(k-hl) =I(e^d'(k+l) 
P«(k) =I(c%d^^(k)) 
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The memorized compensation parameter then 
becomes 

d/M(k + l) =d,'“(k) + 
m/(k + 1) sj'»*(k) Ad, 

The memorized compensation vector is given by 
d'“(k + 1) = d'"(k) — d, (k) + 
d,'“(k + 1) 

where 

dj'«(k) = (Oi, O 2 , . . . 0j-„ d,'«(k), 

• • • Op) 

(k + 1) = (0„ O 2 , . . . 0,-1. d,'« (k + 1) , 
Oj+i, . . . Op) 

The memorized value of the index of perform- 
ance becomes 

FM(k + 1) = I'“(k) + m,'(k + 1) 
(I'(k-|- 1) — pM(k)) 

and the stored direction in which to increment 
d/^ (k -f- 1) is given by 

s,^»i(k-|- 1) = — sgn [(p(k-fl) — 
P^(k))(d,'(k 1) _d/^(k))] 

In the above development, all environmental 


parameters affecting the system are assumed 
to be measurable and measurement noise is 
neglected. All environmental parameters 6' 
are assumed to be constant during one hill- 
climbing step. In the experimental example the 
same assumptions were made except that e** 
was assumed constant for four hill-climbing 
steps. 

Experimental System 

A learning system of the type discussed 
above with second order plant has been simu- 
lated on the IBM 1710-GEDA hybrid computer 
installation. The system was assumed to be 
subjected to a pseudo-random sequence of en- 
vironmental conditions which change the damp- 
ing coefficient and undamped natural frequency 
of the system at discrete intervals. The en- 
vironment was changed sufficiently often that 
purely adaptive action (i.e, hill-climbing on 
the IP surface) could not optimize the system 
during the period of constant environment. 



Figure 2. Experimental Learning System. 
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The system was subjected to a fixed amplitude 
square wave input in order to facilitate the 
computation of an index of performance of the 
quadratic type. In the plant, as shown in Fig. 
2, ei and 02 are environmental parameters 
which are constrained to remain constant over 
two periods of the square wave input. Thus, at 
each occurrence of an (ei, 02) pair four compu- 
tations of the IP are carried out, one at each 
transition of the square wave. After each com- 
putation of the IP, the compensation param- 
eters di and d2 are adjusted by a two-dimen- 
sional hill-climbing technique described above 
in order to minimize the performance index. 
Memory is used to store the current best values 
of di and d2 for a given set of (ei, 02) values, 
as well as the directions in which di and d2 
were being adjusted, and the best value of the 
IP found previously. Thus when a previously 
occurring (ei, 02) pair reoccurs, the best values 
of di and d2 are set from memory and the best 
direction to increment di and d2 is known. 
Adaptation then proceeds and the latest values 
of the best di and d2, direction and IP replace 
the old values in memory. 

For the particular system under investiga- 
tion, ei and 62 were each allowed to take on five 
different values, i.e., qi z= qg = 5. Thus twenty- 
five different (ei, e2) combinations were possi- 
ble. In order to simulate the limitations on 
computer memory capacity, only the sixteen 
most probable (ei, e2) combinations were al- 
lowed to have corresponding di, d2, etc. data 
stored with them. In this system no compensa- 
tion or hill-climbing took place on the nine least 
probable (ei, 02) combinations. Probabilities 
were computed by counting occurrences of (ei, 
62) pairs and a continuous check was made for 
the 16 most probable. In the hill-climbing tech- 
nique, the increments used for changing di and 
d2 were of fixed size and were not reduced even 
after learning was essentially complete. 

The changes in ei and e2 were generated by 
storing in memory a pseudo-random sequence 
of (ei, 02) pairs two hundred long. The (ei, e2) 
pairs were set from memory on the digital pots 
just prior to the square wave transition and the 
sequence of (ei, 02) pairs repeated after each 
two hundred values (fifty different (ei, eo) 
pairs since each pair was repeated at least four 
times in a row) . 


The performance index (IP) used was of the 
form 

T 

IP = I = J (71 -f- 72 X2^ -f- 7s U") dt 

0 

where e, X2, u are as defined in Fig. 2, and 71, 72, 
73,may be changed as desired. In this example, 
the parameter ei was allowed to take on the 
values 4, 5, 6, 7, or 8 and 02 to have the values 
20, 30, 40, 50 or 60. 

An outline of the digital computer program 
and flow diagram for the system under consid- 
eration are given in Appendix A. 


Discussion of Results 


An investigation of the IP surface, for the 
particular system and performance indices 
under investigation, was carried out. The sys- 
tem may be redrawn as shown in Fig. 3 if we 
let a = ei -f di and b = e2 -|- d2. Let the input 
R (s) be a step input of amplitude A. Then 


R(s) 


A 

s 


t(s) = 


A(s + a) 
s^ -f- as -f b 


X2(S) 


A 

s® -H as -f b 


u(s) 


As 

s* -|- as -|- b 


For an index of i)erformance given by 

00 

I = j (yif^ 72 X2^ -f- 73 u^) dt 
0 



Figure 3. Simplified System Block Diagram. 
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For the performance index used in this exam- 
ple 

yi = 0 , 72 = 1, ys = 1 

Thus 

A^[b + 1] _ k[b + 1] 

2ab ab 

The constant IP contours for this perform- 
ance index are shown in Fig. 4 for k = 1.0. 
We note that for this case, the index of per- 
formance is very insensitive to e 2 di = b but 
very sensitive to ei -f di = a. This theoretical 
conclusion was verified experimentally as may 
be seen from the trajectories plotted in Fig. 4 
for several (ej, eo) values. The system trajec- 
tories required approximately two and one- 
half hours operation to reach the points indi- 
cated. The hill-climbing was not completed at 
this stage but the time was sufficient to illus- 
trate that the system was learning. In this 



Figure 4. Index of Performance for I = 


Parseval’s theorem yields 

joo 

1=^/ {yi[*(s) «(— s)] + 

—joo 

y 2 [X 2 (s) X 2 ( s)] + yaCuCs) u( — s)]} ds 

After some manipulation, the integrand be- 
comes 

A^(yi + ys) 

L N yi + ya J L ' N yi + y-s J 

(s^ -j- as -|- b) (s* — as 4- b) 

The IP may be evaluated from standard tables^^ 
to yield 

J A^Cbfyi -|- yii) -j- y2 + yi] 

^ - 2ab 

Then for given yi, y^, and ys, contours of IP 
may be plotted in the (a, b) plane. 
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Figure 5. Learning System Performance. 


length of time the IP decreased by a factor of 
approximately six. The improvement in IP for 
each different (ei, e 2 ) pair may also be seen 
from Fig. 5, which was plotted from computer 
type-out data as discussed in App)endix A. Note 
the IP scale in Fig. 5 was not normalized as are 
the contours in Fig. 4. It is noted that this 


particular performance index becomes zero as 
a 00 , b ^ 00 , and does not have a particular 
minimum. Thus after sufficient operating time, 
the d] and d^. adjustments would simply reach 
the maximum values allowed for in the system 
design (100 and 1000 respectively). It appears 
that the IP is zero for b = — 1 but this value 



322 


PROCEEDINGS->-SPRING JOINT COMPUTER CONFERENCE, 1964 


of b invalidates the derivation for the IP since 
ParsevaFs theorem is valid only for asymp- 
totically stable systems. 

Fig. 5 is a plot of IP versus (ej, eo) pairs for 
the index of performance investigated. Four 
points are plotted for each (ei, eo) pair corre- 
sponding to four different times in the learning 
process. Level A represents the IP measured 
before compensation. The particular sequence 
of (ei, e 2 ) pairs given in this test was such that 
the first twenty-five pairs seen by the system 
were all different (from left to right as plotted 
in Fig. 5). After all twenty-five pairs occurred 
once, the order of occurrence was no longer the 
same. Level B represents the IP for each pair 
after the first four adaptive steps for each 
(ei, e 2 ). The first sixteen (ei, e 2 ) pairs seen by 
the system were compensated initially and in- 
formation was stored. The last nine, in order 
of first appearance, had no information initially 
stored, and thus showed no improvement. 

Level C represents the IP after approxi- 
mately one hour of running. Here it is seen 
that the system had determined the sixteen 
(ei, eo) pairs occurring most frequently (most 
probable) and had continuously learned on 
them. The nine least probable pairs had not 
improved from their initial values. 

Level D represents the IP at the conclusion 
of the test for each pair. Here again, of the 
sixteen most probable, the ones with highest 
probability learned faster. Of course adapta- 
tion would normally be carried out on all (ei, 
02 ) pairs and not just on the 16 most probable. 

The learning system then operates as an 
adaptive system when a given (ei, e 2 ) pair or 
environment first occurs, but has the capability 
of utilizing past information about the best 
compensation parameter setting when a previ- 
ously occurring environment reoccurs. In this 
case, given sufficient operating time, the learn- 
ing system is capable of reducing the index of 
performance to the minimum possible value for 
the 16 most probable (ei, e 2 ) pairs (environ- 
mental conditions) even though the environ- 
ment is changing so rapidly that an ordinary 
adaptive system would fail to improve the index 
of performance significantly. 


Concluding Remarks 

Several particular problems currently under 
investigation are the application of stochastic 
approximation techniques^^^’^^’^^"* to hill-climbing 
when the IP surface is disturbed by noise due 
to unmeasurable environmental parameters and 
the application of pattern recognition tech- 
niques to measurements of the IP surface in 
order to increase the rate of learning. Present 
studies are also concerned with methods of 
quantizing the environment parameters to yield 
efficient learning operation subject to the con- 
straint of finite computer memory capacity. 

The principle of learning outlined in this 
paper, that is, partial hill-climbing of the IP 
surface at each occurrence of a particular en- 
vironment and memorization of the best com- 
pensation found for that environment, seems 
to be a relatively simple idea ; and yet, as illus- 
trated by the example, it yields great improve- 
ment in the system performance over that for 
a simple adaptive system. It is felt that this 
concept of learning control is particularly ap- 
plicable to process control. 
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APPENDIX 

Description of the Computer Program 

The flow diagrams shown in Figs. 6 , 7, and 8 
illustrate the system logic and timing. A more 
detailed examination follows : 

1 . The (ei, e 2 ) values were punched on cards 
and stored in memory at the beginning of the 
program. The program did not take advantage 
of this fact in determining the probability of 
each (ei, e 2 ) pair. 

2. The type-out of results was controlled 
manually by the computer operator by means 
of a Program Switch. If the switch was on and 
a cycle of 200 (ei, ej) values had been com- 
pleted, the type-out was performed, except at 
the beginning of the program where type-out 
was performed after the first 100 (ei, 62 ) values. 
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Figures. Flow Diagram (Part III). 

3. When 10^ (ei, eo) values had occurred, the 
total number of recorded occurrences of each 
(ei, e2) pair was divided by ten and a counter 
was reset to count the next 10*^ values. Then 
each new occurrence of a particular (ei, 02) 
pair caused the number of recorded occurrences 
for that (ei, 02) to be increased by one. Thus 
the more recent occurrences were weighted 
more heavily. 

4. When a new pair, (ei, eo), was selected, it 
was compared to the previous pair, (ei, e2)i-i. 
If they were the same, the search procedure for 
(ei, 02) and its information was bypassed, since 
the location of information for (ei, 02)1 was the 
same as for (ej, 02)1-1. As shown in 8 below, an 
indicator was placed in memory for each (ci, 
02) pair showing which dj was operated on last. 
Then the program would operate on the other 
dj when that (ei, 02) pair occurred again. In 
order for the program to operate on the same dj 
when an (ei, e^) pair occurred twice or more in 
a row, the indicator was changed when (ei, 02)1 
was determined to be the same as (ei, e2)i-i. 


5. If (ei, e2)i ^ (ei, e2)i-i, then memory was 
searched to see if ei had ever occurred previ- 
ously. Next e2 was checked for previous ocur- 
rence. Then (ei, e2)i was assigned an address 
where information was stored concerning its 
probability and location of index of perform- 
ance and compensation information. The prob- 
ability was incremented next (actually the num- 
ber of recorded occurrences for (ei, 02)1 was 
increased by one) . 

6. At this point memory was organized so 
that only the sixteen most frequent (ei, e2) 
.pairs had compensation information stored. If 

(61, e2)i had no information stored, then di and 
d2 were set to zero. 

7. The following information was stored for 
the sixteen most probable (ei, e2) pairs: 

a) whether dj or d2 was operated on most 
recently 

b) the value of dj (j —1^ 2) that resulted 
in the best (lowest) IP value 

c) the slope (direction of change) for dj 

(j = 1, 2) 

d) the sign of dj ( j =: 1, 2) 

e) the best previous value of IP. 

8 . If information was stored for (ei, 02)1, the 
program next determined from an indicator 
which dj was operated on last. Assume this was 
di. The stored value of di was then set, and 
operations commenced on d2. The indicator was 
then set to show that the program acted on d2 
last (see 4 above). Next, the slope was deter- 
mined. 

a) If the slopd was positive, the program 
added the standard increment. Ad, to d2 and 
the sum was compared with 9999, since this 
was the maximum setting on the digital po- 
tentiometer. If the sum was greater than 
9999, then do was set to 9999. If not d2 was 
set equal to the sum in question. 

b) If the slope was negative, and d2 was 
greater than Ad, then Ad was subtracted 
from do. If d2 was less than Ad, the sign of 
ds was changed and the magnitude of d2 was 
not changed. The sign of dj was made nega- 
tive or positive, respectively, by switching an 
amplifier in or out of the feedback loop (see 
Fig. 2). If the sign was negative the ampli- 
fier was in, thus yielding a subtractive cor- 
rection. 
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c) Note that the previous values of di and 
do were left in memory. The new dj were 
stored later only if the new IP to be meas- 
ured was lower than the previously stored 
value of the IP (see 13 below) . 

9. The proper values of ei, 62 , di and d 2 were 
then set on the analog computer and the IP 
integrator was set for integration. 

10. The step input was generated next, using 
a comparator on the GEDA. A delay followed, 
during which the IP was computed. At the end 
of the delay, the IP was measured and the IP 
integrator was disconnected and reset to zero. 

11. If no information was stored for (ei, 02 ) 1 , 
the system had completed the cycle and the next 
(ei, e 2 ) pair was obtained. 

12. When information was stored for (ei, 
02 ) 1 , and if the measured IP was greater than 
the stored best previous IP, then the slope indi- 
cator was changed on dj, so that dj would be 
incremented in the opposite direction the next 
time the same (ei, e 2 ) pair occurred. If the 
slope of dj was changed two consecutive times, 
then the present measured value of IP was 
stored and dj and d 2 were not changed in mem- 
ory. This procedure was implemented to allow 
for the possibility that an erroneously low value 
of IP was stored due to any random noise pulses 
introduced into the system at some previous 
time. If the slope of dj had not changed twice 
consecutively, only the slope indicator was 
changed in memory and the system had com- 
pleted the cycle. 

13. If the measured value of IP was lower 
than the stored value, then the measured value, 
along with the present di and da values, were 
placed in memory and the cycle was complete. 
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A HEURISTIC PROGRAM TO SOLVE 
GEOMETRIC-ANALOGY PROBLEMS 

Thorms G. Evans 

Air Force Cambridge Research Laboratories (OAR) 
Bedford, Massachusetts 


INTRODUCTION 

The purpose of this paper is to describe a 
program now in existence which is capable of 
solving a wide class of the so-called 'geometric- 
analogy' problems frequently encountered on 
intelligence tests. Each member of this class of 
problems consists of a set of labeled line draw- 
ings. The task to be performed can be concisely 
described by the question : 'figure A is to figure 
B as figure C is to which of the given answer 
figures?' For example, given the problem illus- 


A 


A 



I 


2 3 4 



□ 



A 


Figure 1. 


trated as Fig. 1, the geometric-analogy program 
(which we shall subsequently call ANALOGY, 
for brevity) selected the problem figure labeled 
4 as its answer. It seems safe to say that most 
people would agree with ANALOGY'S answer 
to this problem (which, incidentally, is taken 
from the 1942 edition of the Psychological Test 


for College Freshmen of the American Council 
on Education). Furthermore, if one were re- 
quired to make explicit the reasoning by which 
he arrived at his answer, prospects are good 
that the results would correspond closely to the 
description of its 'reasoning' produced by 

ANALOGY. 

At this point, a large number of questions 
might reasonably be asked by the reader. Four, 
in particular, are : 

(i) Why were problems of this type chosen 
as subject matter? 

(ii) How does ANALOGY go about solving 
these problems? 

(iii) How competent is ANALOGY at its sub- 
ject matter, especially in comparison to human 
performance ? 

(iv) What has been learned in the construc- 
tion of ANALOGY and what implications might 
this study have for the further development of 
problem-solving programs in general ? 

The remainder of this paper constitutes an 
attempt to answer these questions in some de- 
tail. We first deal with a variety of motivations 
for this investigation and attempt to place it in 
the context of other work in related areas. Next 
we turn to detailed consideration of the problem 
type and of the mechanism of the ANALOGY 
program. Finally, we present some answers to 
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the remaining two questions raised above. (A 
more detailed discussion of all these issues can 
be found in Ref. 1). 

Motivations and Background 

In our opinion ample general justification for 
the development and study of large heuristic 
problem-solving programs has been provided 
(both through argument and through example) 
by previous workers in this area. We shall not 
attempt to add to it. Given that one is inter- 
ested in the construction of such programs, a 
number of reasons can be advanced for the 
choice of geometric-analogy problems as a suit- 
able subject matter. Some of these are ; 

(i) Problems of this type require elaborate 
processing of complex line drawings : in par- 
ticular, they require an analysis of each picture 
into parts and the determination and use of 
various relationships among these parts. This 
is an interesting problem per se and one which 
can reasonably be expected to be of great prac- 
tical importance in the near future. 

(ii) The form of the problems requires one to 
find a transformation that takes figure A into 
figure B and takes figure C into exactly one of 
the answer figures. This situation provides a 
natural opportunity for trying out certain ideas 
about the use of explicit internal 'descriptions' 
(here, of both figures and transformations) in 
a problem-solving program. Furthermore, more 
speculatively, it presents an interesting para- 
digm of 'reasoning by analogy,' a capacity 
which may play a large role in far more sophis- 
ticated problem-solving programs in the future. 
(In Section 5 we discuss the possible relevance 
of ANALOGY to the introduction into problem- 
solving programs of more powerful learning 
mechanisms than have yet been achieved.) 

(iii) Problems of this type are widely re- 
garded as requiring a considerable degree of 
intelligence for their solution and in fact are 
used as a touchstone of intelligence in various 
general intelligence tests used for college ad- 
mission and other purposes. This suggests a 
non-trivial aspect of any attempt to mechanize 
their solution. 

We shall now attempt very briefly to place 
ANALOGY in the context of earlier work in re- 


lated areas. Two aspects of ANALOGY must 
be considered: 

(i) ANALOGY contains a substantial amount 
of machinery for the processing of representa- 
tions of line drawings, including decomposition 
into subfigures, calculation of relations between 
figures, and 'pattern-matching' computations. 
Thus we must relate it to other work in picture 
processing and pattern recognition. 

(ii) ANALOGY is a complex heuristic prob- 
lem-solving program, containing an elaborate 
mechanism for finding and 'generalizing' trans- 
formation rules. Thus we must relate it to other 
work on the development of problem-solving 
programs. 

We turn first to the picture-processing aspect. 
The essential feature of the treatment of line 
drawings by ANALOGY is the construction, 
from relatively primitive input descriptions, of 
more 'abstract' descriptions of the problem fig- 
ures in a form suitable for input to the rule- 
finding program. The fundamental program- 
ming technique underlying this method is the 
use of a list-processing language, in this case 
LISP,^ *"* to represent and process the figures in 
question. Work in picture processing, for pat- 
tern-recognition purposes, involving some ele- 
ments of description, is found in Grimsdale et 
aZ.,^ Marill et aZ.,"‘ and Sherman,^* among others. 
Sutherland^ and Roberts^ have used, for quite 
different purposes, internal representations of 
line drawings similar in some respects to those 
used in ANALOGY. Kirsch^ has worked with 
complex line drawings primarily as a vehicle 
for programs involving the analysis of English- 
language sentences pertaining to such pictures. 
Modes and Canaday" have used LISP expres- 
sions for figure description in much the same 
way that we have, though the development of 
machinery for manipulating such descriptions 
was, of necessity, carried much further in 
ANALOGY. Evidently the first advocacy of 
‘scene description' ideas (for use in pattern 
recognition) occurs in Minsky.'- 

To place ANALOGY with respect to other 
work with problem-solving programs, we shall 
simply list a number of developments in the con- 
struction of problem-solving programs which 
have influenced, in a general way, our approach 
to the design of ANALOGY. These include LT 
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(the Logic Theorist) and, more recently, GPS 
(the General Problem Solver) of Newell, 
Simon, and Shaw, the plane-geometry theorem- 
pro ver**"’ of Gelernter and Rochester, and 
SAINT, the formal integration program of 
Slagle 

Summary of the Solution Process, 
with Example 

To exhibit as clearly as possible the entire 
process carried out by ANALOGY, we now 
sketch this process, then examine its operation 
on an example. The sample problem we shall be 
considering is shown as Fig. 2 (where the Pi's 
are not part of the problem figures but labels 
keying the corresponding parts of the figures to 



B 



C 



12 3 4 5 



Figure 2. 


expressions we shall give below). Before treat- 
ing the example, we shall summarize the entire 
solution process. Given a problem such as that 
above, ANALOGY proceeds as follows: First, 
the input descriptions of the figures are read. 
Currently these descriptions, given as LISP ex- 
pressions in a format to be illustrated below, 
are hand-made; however, they could well be 
mechanically generated from scanner or light- 
pen input by a relatively straightforward, quite 
‘unintelligent' program embodying line-tracing 
techniques already described in the literature. 
The descriptions represent the figures in terms 
of straight line segments and arcs of circles tto 
any desired accuracy, at the cost of longer and 
longer expressions). Examples of the descrip- 
tions are given below. 

The first step taken by ANALOGY is to de- 
compose each problem figure into ‘objects' (sub- 
figures) . The decomposition program originally 
written, which was sufficient to handle many 



cases, including the example to be discussed be- 
low, was quite simple. It merely separated a 
problem figure into its connected subfigures ; 
e.g., figure A of the above example consists of 
the three objects labeled PI, P2, and P3. It 
later became desirable to have a more sophisti- 



Figure 3b. 


cated decomposition program with, in particu- 
lar, the capability of separating overlapped 
objects on appropriate cues. For example, sup- 
pose problem figure A is as in Fig. 3a and figure 
B is as in Fig. 3b. The decomposition program 
should be able to separate the single object of 
figure A into the triangle and rectangle on the 
basis that they appear in figure B, from which 
point the remaining mechanism of parts I and 
II could proceed with the problem. While a de- 
composition program of the full generality de- 
sirable has not yet been constructed, the most 
recent version of the program is capable, in par- 
ticular, of finding all occurrences of an arbi- 
trary simple closed figure x in an arbitrary 
connected figure y ; for each such occurrence the 
program can, if required, separate y into two 
objects: that occurrence of x and the rest of y 
(described in the standard figure format — note 
that this ‘editing' can be rather complex: con- 
nected figures can be split into non-connected 
parts, etc.). 
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The type of decomposition illustrated above 
might be called 'environmental/ in that, e.g., 
figure A is separated into subfigures on the in- 
formation that these subfigures are present, 
already separated, in figure B. An interesting 
extension to the present part I of ANALOGY 
might be to incorporate some form of 'intrinsic’ 



Figure 4a. 

decomposition in which 'most plausible’ decom- 
positions are generated according to Gestalt- 
like criteria of 'good figure.’ Such an extension 
could widen the problem-solving scope of 
ANALOGY considerably to include many cases 
where the appropriate subfigures do not appear 
already 'decomposed’ among the problem fig- 
ures. For example, suppose problem figures A 
and B are as shown in Figs. 4a and 4b, respec- 



o 

Figure 4b. 


tively. A decomposition into the square, trian- 
gle, and circle seems necessary to state a rea- 
sonable transformation rule. This example, in- 
cidentally, illustrates one potentially useful 
'intrinsic’ decomposition heuristic : roughly, 
choose decompositions into subfigures which 
have as much internal symmetry (in some pre- 
cise sense) as possible. 

Next, the ‘objects’ generated from the decom- 
position process are given to a routine which 
calculates a specified set of properties of these 
objects and relations among them. The program 
is designed so that this set can be changed 
easily. As a sample of a relation-calculating 
subroutine, we cite one that calculates, for fig- 
ure A of our example, that the object labeled 
P2 lies inside that labeled P3 and generates a 
corresponding expression (INSIDE P2 P3) to 
be added to the part I output description of fig- 
ure A. The method used involves calculating all 
intersections with P3 of a line segment drawm 
from a point on P2 to the edge of the field (all 
figures are considered as drawn on a unit 
square) . In this case P2 lies inside P3 since the 
number of such intersections is odd, namely one 
(and P3 is known to be a simple closed curve — 
if it were not, the calculation just described 
would be performed for each closed curve con- 
tained in P3). To do this, a substantial reper- 
toire of 'analytic geometry’ routines is required 
for part I, to determine, for example, intersec- 
tions of straight line segments and arcs of 
circles in all cases and combinations. Other re- 
lation routines available in part I calculate, for 
example, that in figure A of our example PI 
is above P2 and P3 and in figure B that P4 is to 
the left of P5. 

The principal business of part I, aside from 
decomposition and the property and relation 
calculations, is a set of 'similarity’ calculations. 
Here, part I determines, for each appropriate 
pair of objects, all members from a certain class 
T of transformations which carry one object of 
the pair into the other. The elements of T are 
compositions of Euclidean similarity transfor- 
mations (rotation and uniform scale change) 
with horizontal and vertical reflections. Given 
descriptions of virtually any pair of arbitrary 
line-drawings x and y, the routines of part I 
will calculate the parameters of all instances of 
transformations from T that 'map’ x into y. 
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More precisely, an acceptable 'map’ is a member 
of T for which T(x) is congruent to y up to 
certain metric tolerances which are parameters 
in the corresponding programs. 

This routine is, in effect, a pattern-recogni- 
tion program with built-in invariance under 
scale changes, rotations, and certain types of 
reflections. It consists essentially of a topologi- 
cal matching process, with metric comparisons 
being made between pairs of lines selected by 
the topological process. In Ref. 6 Sherman in- 
troduced some topological classification into a 
sequential decision tree program for the recog- 
nition of hand-printed letters, but the notfon of 
systematically using the topological informa- 
tion to determine which metric comparisons ^re 
to be made seems to be new. This type of organ- 
ization for pattern recognition has its own ad- 
vantages (e.g., flexibility — ^the metric parts can 
be changed easily with no effect on the overall 
structure) and difficulties (e.g., sensitivity to 
metrically small changes in a figure which affect 
the connectivity — but this sensitivity can be 
largely removed by suitable pre-processing). 
Incidentally, it may be worth noting that if we 
suppress the metric comparisons entirely we 
get a general, and reasonably efficient, topo- 
logical equivalence algorithm for graphs (net- 
works) . 

The set of techniques we have just been 
describing, based on the use of a list-processing 
language to perform processing of line drawings 
by manipulating their list-structured descrip- 
tions, is by no means limited in applicability to 
the uses to which we have put it in part I of 
ANALOGY. To the contrary, it is our view that 
the representation of line drawings used here 
and the corresponding processing routines form 
a suitable basis for the development of a quite 
powerful 'line-drawing-manipulation language’ 
with potential usefulness in a wide variety of 
applications. Regardless of whether the present 
investigation turns out to have a measurable 
influence on the art of designing problem-solv- 
ing programs, it seems probable that the prin- 
cipal short-range contribution of ANALOGY 
is in the picture-processing by-products just 
described. (Incidentally, these techniques were 
discussed briefly from an ANALOGY-independ- 
ent point of view in Ref. 17.) 


After the similarity information is computed 
for every required pair of objects, both within 
a problem figure and between figures, this in- 
formation, together with the decomposition and 
property and relation information, is punched 
out on cards in a standard format for input to 
part II. (For a typical set of figures, the total 
output of part I, punched at up to 72 columns/ 
card, might come to 15 to 20 cards.) 

Part II is given these cards as input. Its 
final output is either the number of the solution 
figure or a statement that it failed to find an 
answer. Its first step is to generate a rule (or, 
more frequently, a number of alternate rules) 
transforming figure A into figure B. Such a 
rule specifies how the objects of figure A are 
removed, added to, or altered in their properties 
and their relations to other objects to generate 
figure B. Once this set of rule possibilities has 
been generated, the next task is to ‘generalize’ 
each rule just enough so that the resulting rules 
still take figure A into figure B and now take 
figure C into exactly one of the answer figures. 
More precisely, for each ‘figure A ^ figure B’ 
rule and for each answer figure, part II at- 
tempts to construct a ‘common generalization’ 
rule which both takes figure A into figure B and 
figure C into the answer figure in question. This 
process may produce a number of rules, some 
very weak in that virtually all the distinguish- 
ing detail has been ‘washed out’ by ‘generaliza- 
tion.’ Hence it is necessary at this point to pick 
the ‘strongest’ rule by some means. This entire 
process requires a complex mechanism for ma- 
nipulating and testing the rules and deciding 
which of the several rule candidates, the results 
of different initial rules or of different ‘general- 
izations,’ is to be chosen. 

The principal method embodied in part II at 
present is able to deal quite generally with prob- 
lems in which the numbers of parts added, re- 
moved, and matched in taking figure A into 
figure B are the same as the numbers of parts 
added, removed, and matched, respectively, in 
taking figure C into the answer figure. A sub- 
stantial majority of the questions on the tests 
we have used are of this type, as is our present 
example; virtually all would be under a suffi- 
ciently elaborate decomposition process in part 
I ; this restriction still permits a wide variety 
transformation rules. It should be mentioned 
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that the methods of part II have been kept 
subject-matter free; no use is made of any geo- 
metric meaning of the properties and relations 
appearing in the input to part II. 

The more detailed workings of both parts I 
and II are best introduced through examining 
the process sketched above at work on our ex- 
ample. To convey some further feeling for the 
nature of the input to part I, we exhibit part of 
it, namely, the input description of figure A. 
The LISP expressions look like ; 

( 

(DOT (0.4 . 0.8) ) 

(SCO ( (0.3 . 0.2) 0.0 (0.7 . 0.2) 0.0 (0.5 . 
0.7) 0.0 (0.3. 0.2))) 

(SCO ( (0.4 . 0.3) 0.0 (0.6 . 0.3) 0.0 (0.6 . 
0.4) 0.0 (0.4 . 0.4) 0.0 (0.4 . 0.3))) 

) 

The first line above corresponds to the dot (at 
coordinates x = 0.4 and y = 0.8 on the unit 
square; the coordinate pairs in the other ex- 
pressions are interpreted analogously). The 
next two lines correspond to the triangle (SCC 
stands for simple closed curve. All connected 
figures are divided into three classes — dots 
(DOT), simple closed curves (SCC), and all 
the rest (REG). This is solely for reasons of 
programming convenience ; no other use is made 
of this three-way classification). Each non- 
connected figure is represented simply by a list 
of descriptions of its connected parts. 

A curve (which may consist of an arbitrary 
sequence of elements chosen from straight line 
segments and arcs of circles) is represented by 
a list in which coordinate pairs alternate with 
the curvatures of the line elements between (all 
curvatures are zero here since the lines in ques- 
tion are all straight) . Similarly, the next two 
lines above correspond to the rectangle ; the en- 
tire description of figure A is then a list of the 
descriptions of these three parts. The format 
corresponding to the non-SCC figures like the 
Z-shaped subfigure of figure C is similar though 
somewhat more complex ; it looks like : 

(REG ( (VI V2 (0.0 (0.55 . 0.5) 0.0 (0.45 . 
0.3) 0.0)) 

(V2 VI (0.0 (0.45 . 0.3) 0.0 (0.55 . 0.5) 

0 . 0 )))) 


where VI and V2 are the two vertices (here, 
endpoints) of the figure. The coordinates of VI 
and V2 are given to part I in a separate list. 
They are VI = (0.45 . 0.5), V2 = (0.55 . 0.3). 
Here, the top-level list describes the connectiv- 
ity by stating which vertices are connected to 
which and how often — sublists describe in de- 
tail the curves making these connections. (By 
vertex we mean either an endpoint of a curve 
or a point at which three or more curves come 
together.) The complete details of the input 
format are given in Ref. 1, along with many 
examples. 

When the input shown above corresponding 
to problem figure A and the corresponding 
inputs for the other seven figures are processed, 
the output from part I is, in its entirety, the ten 
LISP expressions shown below. For brevity, 
all similarity information concerning non-null 
reflections has been deleted. Also, we have re- 
placed the actual arbitrary symbols generated 
internally by ANALOGY as names for the parts 
found by the decomposition program by the 
names PI, P2, etc., which appear as labels on 
our example figures above. The ten output ex- 
pressions are: 

(1) ((PI P2 P3) . ((INSIDE P2 P3) 

(ABOVE PI P3) (ABOVE PI P2) ) ) 

(2) ((P4 P5) . ((LEFT P4 P5))) 

(3) ((P6 P7 P8) . ((INSIDE P7 P6) 

(ABOVE P8 P6) (ABOVE P8 
P7))) 

(4) ((P2 P4 (((1.0 . 0.0) . (N.N)) ((1.0 . 

3.14) . (N.N)))) (P3 P5 (((1.0 . 

0.0) . (N.N))))) 

(5) ((PI P8 (((1.0 . 0.0) . (N.N))))) 

(6) NIL 

(7) ( (P9 PIO Pll) (P12 P13) (P14 P15) 

(P16 P17) (P18) ) 

(8) ( ((INSIDE PIO P9) ABOVE Pll P9) 

(ABOVE Pll PIO)) ((LEFT P12 

P13)) ((INSIDE P15 P14)) 

((ABOVE P17 P16)) NIL) 

(9) ( ((P6P9 (((1.0. 0.0) . (N.N)))) (P7 

PIO (((1.0 . 0.0) . (N.N)) ((1.0 . 

—3.14) . (N.N)))) (P8P11 (((1.0 

. 0.0) . (N.N))))) 
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((P6 P13 (((1.0 . 0.0) . (N.N)))) 
(P7 P12 (((1.0 . 0.0) . (N.N)) 
((1.0 . -3.14) . (N.N))))) 

((P6 P14 (((1.0 . 0.0) . (N.N)))) 
(P7 P15 (((1.0 . 0.0) . (N.N)) 
((1.0 . —3.14) . (N.N))))) 

((P6 P16 (((1.0 . 0.0) . (N.N)))) 
(P8 P17 (((1.0 . 0.0) . (N.N))))) 

( (P7 P18 ( ( (1.0 . 0.0) . (N.N) ) ( (1.0 . 
-3.14) . (N.N.))))) ) 

(10) ( ( ((PI Pll (((1.0 . 0.0) . (N.N))))) 
NIL NIL 

((PI P17 (((1.0 . 0.0) . (N.N))))) 
NIL ) . (NIL NIL NIL NIL NIL) ) 

To explain some of this: The first expression 
corresponds to figure A. It says figure A has 
been decomposed into three parts, which have 
been given the names PI, P2, and P3. Then we 
have a list of properties and relations and simi- 
larity information internal to figure A, namely, 
here, that P2 is inside P3, PI is above P2, and 
PI is above P3. The next two expressions give 
the corresponding information for figures B 

CktLXKX Vy# X 11^ XjLL ’COOiV/Al g X V \^0 AXA-LV/x X4.J.xvv4.vi.x 

about Euclidean similarities between figure A 
and figure B. For example, P3 goes into P5 
under a ‘scale factor = 1, rotation angle =: 0, 
and both reflections nulf transformation. The 
next two expressions contain the corresponding 
information between figure A and figure C and 
between figure B and figure C, respectively. The 
seventh list is a five-element list of lists of the 
parts of the five answer figures i the eighth a 
five-element list of lists, one for each answer 
figure, giving their property, relation, and simi- 
larity information. The ninth is again a five- 
element list, each a ‘similarity’ list from fig- 
ure C to one of the answer figures. The tenth, 
and last, expression is a dotted pair of expres- 
sions, the first again a five-element list, a ‘simi- 
larity’ list from figure A to each of the answer 
figures, the second the same from figure B to 
each of the answer figures. This brief descrip- 
tion leaves certain loose ends, but it should pro- 
vide a reasonably adequate notion of what is 
done by part I in processing our sample prob- 
lem. 

The ten expressions above are given as argu- 
ments to the top-level function of part II 


(optimistically called solve). The basic method 
employed by solve, which suffices to do this 
problem, begins by matching the parts of fig- 
ure A and those of figure B in all possible ways 
compatible with the similarity information. 
From this process, it concludes, in the case in 
question, that P2 ^ P4, P3 P5, and PI is 
removed in going from A to B. (The machinery 
provided can also handle far more complicated 
cases, in which alternate matchings are possible 
and parts are both added and removed.) On 
the basis of this matching, a statement of a 
rule taking figure A into figure B is generated. 
It looks like : 

( 

(REMOVE A1 ( (ABOVE A1 A3) (ABOVE 
A1 A2) (SIM OB3 A1 (((1.0 . 0.0) . 
(N.N)))))) 

(MATCH A2 (((INSIDE A2 A3) (ABOVE 
A1 A2) (SIM OB2 A2 (((1.0 . 0.0) . 
(N.N))))) . ((LEFT A2 A3) (SIM 
OB2 A2 (((1.0 . 0.0) . (N.N)) ((1.0 . 
3.14) . (N.N)))) (SIMTRAN (((1.0 . 
0.0) . (N.N)) ((1.0 . 3.14) . (N.N) 
)))))) 

(MATCH A3 (( (INSIDE A2 A3) (ABOVE 
A1 A3) (SIM OBI A3 (((1.0 . 0.0) . 
(N.N))))) . ((LEFT A2 A3) (SIM 
OBI A3 (((1.0 . 0.0) . (N.N)))) 
(SIMTRAN (((1.0 . 0.0) . (N.N) 
)))))) 

) 

The A’s are used as ‘variables’ representing 
objects. The format is rather simple. For each 
object added, removed, or matched, there is a 
list of the properties, relations and similarity 
information pertaining to it. (In the case of a 
matched object, there are two such lists, one 
pertaining to its occurrence in figure A and the 
other to its occurrence in figure B.) There are 
two special devices ; the (SIM OBI . . .) — form 
expressions give a means of comparing types of 
objects between, say, figure A and figure C ; the 
other device is the use of the SIMTRAN expres- 
sions in the figure-B list for each matched ob- 
ject. This enables us to handle conveniently 
some additional situations which we shall omit 
from consideration, for brevity. They are 
treated in detail in Ref. 1. 
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The above rule expresses everything about 
figures A and B and their relationship that is 
used in the rest of the process. (The reader 
may verify that the rule does, in some sense, 
describe the transformation of figure A into 
figure B of our example.) 

Next, a similarity matching is carried out be- 
tiveen figure C and each of the five answer fig- 
ures. Matchings which do not correspond to the 
ones between figure A and figure B in numbers 
of parts added, removed, and matched, are dis- 
carded. If all are rejected this method has 
failed and solve goes on to try a further method. 
In the present case, figures 1 and 5 are rejected 
on this basis. However, figures 2, 3, and 4 pass 
this test and are examined further, as follows. 
Choose an answer figure. For a given matching 
of figure C to the answer figure in question 
(and solve goes through all possible matchings 
compatible with similarity) we take each ‘fig- 
ure A figure B' rule and attempt to fit it to 
the new case, making all matchings between the 
A's of the rule statement and the objects of 
figure C and the answer figures which are com- 
patible with preserving add, remove, and match 
categories, then testing to see which informa- 
tion is preserved, thus getting a new, ‘general- 
ized' rule which fits both ‘figure A -> figure B' 
and ‘figure C the answer figure in question.' 
In our case, for each of the three possible an- 
swer figures we get two reduced rules in this 
way (since there are two possible pairings be- 
tween A and C, namely, PI P8, P2^ — >P6, 
and P3 P7, or PI <— > P8, P2 ^ P7, and 
P3 ^ P6). 

In some sense, each of these rules provides an 
answer. However, as pointed out earlier, we 
want a ‘best' or ‘strongest' rule, that is, the one 
that says the most or is the least alteration in 
the original ‘figure A figure B' rule and that 
still maps C onto exactly one answer figure. A 
simple device seems to approximate human 
opinion on this question rather well ; we define 
a rather crude ‘strength' function on the rules 
and sort them by this. If a rule is a clear winner 
in this test, the corresponding answer figure 
is chosen ; if the test results in a tie, the entire 
method has failed and solve goes on to try 
something else. In our case, when the values 
for the six rules are computed, the winner is one 


of the rules corresponding to figure 2, so the 
program, like all humans so far consulted, 
chooses it as the answer. The rule chosen looks 
like this : 

( 

(REMOVE A1 ((ABOVE A1 A3) (ABOVE 
A1 A2) (SIM OB3 A1 (((1.0 . 0.0) . 
(N.N)))))) 

(MATCH A2 (((INSIDE A2 A3) (ABOVE 
A1 A2) ) . ( (LEFT A2 A3) (SIMTRAN 
(((1.0 . 0.0) . (N.N)) ((1.0 . 3.14) . 
(N.N))))))) 

(MATCH A3 (((INSIDE A2 A3) (ABOVE 
A1 A3) ) . ( (LEFT A2 A3) (SIMTRAN 
(((1.0 . 0.0) . (N.N))))))) 

) 

Again, it is easy to check that this rule both 
takes figure A into figure B and figure C into 
figure 2, but not into any of the other answer 
figures. 

Further Examples and Comments 
(a) Examples 

We first exhibit several additional examples 
of problems given to ANALOGY : 

(i) (See Fig. 5) 
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Figure 5. 


Here the rule involves changes in the rela- 
tions of the three parts. ANALOGY chose an- 
swer figure 3. 

(ii) (See Fig. 6) 

This case involves both addition and removal 
of objects. ANALOGY chose answer figure 2. 

(iii) (See Fig. 7) 
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Figure 7. 

Note that this case required the more power- 
ful decomposition program. Here ANALOGY 
chose figure 3. 

(iv) (See Fig. 8) 

The rule here simply involved a rotation. 
ANALOGY chose figure 2. 

(v) (See Fig. 9) 

Here ANALOGY chose figure 3, using an 
extension of the part II techniques discussed 
above. This extension, employed after failure 
of the basic process, involves systematic substi- 
tution of certain specified relations (e.g., LEFT 
for ABOVE) for others in the part II input 
descriptions, thus making it possible for 
ANALOGY to relate the ‘vertical’ transforma- 
tion taking A into B to the ‘horizontal’ trans- 
formation of C into 3. 

(vi) In the problem of Fig. 1, the large circle 
of answer figure 4 was replaced by a large 
square and the problem rerun. Again figure 4 
was chosen but by a different rule. Now, in- 
stead of the inner object being removed, as be- 


rigure 9. 

fore, the outer object is removed and the inner 
one enlarged. This illustrates some of the flexi- 
bility of the procedure and the dependence of 
the answer choice on the range of allowed an- 
swers as well as on A, B, and C. 

(vii) (See Fig. 10) 

Here is an example of a failure by ANAL- 
OGY to agree with the human consensus which 
favors figure 5. ANALOGY chose figure 3. 
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Figure 10 . 
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(b) Comparison with Human Performance 

We can only roughly compare the perform- 

ance of ANALOGY with that of humans on 
geometric-analogy problems, since ANALOGY 
has not yet been given the complete set of such 
problems from any test for which scores are 
available. However, as some indication, we cite 
scores on the ACE tests based on a period of 
years including those editions of the test from 
which most of the problems on which ANAL- 
OGY was tested were selected. These scores are 
for a large population of college-preparatory 
students ; the median score, on a test consisting 
of* 30 such questions, ranged from 17 for 9th 
grade to 20 for 12th grade. Our estimate is that, 
on the same tests, ANALOGY, as it currently 
exists, could solve between 15 and 20 problems. 
Given, in addition, certain changes (mostly in 
part I, e.g., a more powerful decomposition 
program and additional properties and rela- 
tions) for which we have reasonably well- 
worked-out implementations in mind, ANAL- 
OGY should be capable of perhaps 25 successful 
solutions. 

(c) The Use of LISP 

The use of a list-processing language to con- 
struct the ANALOGY program appears to have 
been a suitable choice; most notably, its capa- 
bility at handling intermediate expressions of 
unpredictable size and ‘shape' (such as our 
figure descriptions and transformation rules) 
is of great value. We especially wish to praise 
LISP as a convenient and elegant language in 
which to write and debug complex programs. 
The ease of composition of routines, the highly 
mnemonic nature of the language, and the good 
tracing facilities all contribute greatly to effec- 
tive program construction. In return for the 
use of such a language one pays a certain price 
in speed and storage space, which, in the case of 
ANALOGY, at least, was a very acceptable 
bargain, since the necessity of machine-lan- 
guage coding would have made the entire proj- 
ect unfeasible. Incidentally, the ANALOGY 
program (apparently the largest program writ- 
ten in LISP to date) is so large that parts I 
and II must occupy core separately. The conse- 
quent limited (and one-way) communication 
between the parts was a serious design con- 
straint but proved to have some compensating 
advantages in simplicity. 


ANALOGY and Pattern-Recognition in 
Problem-Solving Programs 

In this section we shall consider certain 
aspects of the design of problem-solving ma- 
chines. To aid this discussion we shall specify 
(rather loosely) a subclass of problem-solving 
machines and carry out our discussion in terms 
of these though the ideas involved are by no 
means limited in applicability to this class. The 
machines we have in mind are typified by 
GPS*^ in that the problem to be solved by the 
machine is to transform one specified ‘object' or 
‘situation' (whatever this may mean in a par- 
ticular subject-matter context) into another by 
applying an appropriate sequence of transfor- 
mations chosen from a class available to the 
machine. A wide variety of problems may be 
cast in this form (again see Ref. 14 or other dis- 
cussions of GPS by the same authors). As in 
GPS, subgoals may be generated and attacked 
by such a machine and elaborate schemes of 
resource allocation may be required. However, 
these aspects do not concern us here. Our inter- 
est lies in the basic task of the machine ; given a 
pair of ‘objects,' it must choose an ‘appropriate' 
transformation, i.e., one contributing to the goal 
of transforming one of the given ‘objects' into 
the other. 

It is a widely-held view, with which we agree 
completely, that for a machine to be capable of 
highly intelligent behavior on a task of this 
kind, in a rich environment of objects and 
transformations (and, in particular, to be capa- 
ble of learning at a level more advanced than 
that of present machines), the critical factor is 
that it have a good internal representation of 
both its subject matter (‘objects') and its meth- 
ods (‘transformations'), as well as an elaborate 
set of ‘pattern-recognition' techniques for 
matching transformations to object pairs. 
Probably this means a quite ‘verbal' represen- 
tation of both objects and transformations as 
expressions in suitable ‘description languages.' 
Furthermore, these matching techniques must 
be represented in a form in which they them- 
selves are capable of being improved as the ma- 
chine gains experience. The central role which 
‘pattern-recognition' techniques must play in 
sophisticated problem-solving programs and the 
corresponding importance for effective learning 
of autonomous improvement in the perform- 
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ance of these techniques are well expressed in 
Minsky.*^ There we find : 

In order not to try all possibilities a re- 
sourceful program must classify problem situa- 
tions into categories associated with the do- 
mains of effectiveness of the machine's different 
methods. These pattern-recognition methods 
must extract the heuristically significant fea- 
tures of the objects in question. Again from 
Ref. 12 we have : 

Again from we have : 

In order to solve a new problem one uses 
what might be called the basic learning heuris- 
tic — first try using methods similar to those 
which have worked, in the past, on similar 
problems. 

Here, the problem is, of course, to have pat- 
tern-recognition techniques possessing, or able 
themselves to learn, criteria of ‘similarity' ap- 
propriate to the subject matter in question. 

The ‘fixed-length property-list' schemes (see 
Ref. 12) which characteristically have been used 
to perform this pattern-recognition task in cur- 
rent problem-solving programs have two prin- 
cipal defects which limit their extension to 
harder problems : 

(i) While, in principle, given enough suffi- 
ciently elaborate properties, one can make arbi- 
trarily fine discriminations, in practice a given 
set of properties will begin to fail rapidly as 
situations become more complex. In particular, 
for ‘situations' which must be treated as con- 
sisting of interrelated parts, the ‘global' nature 
of the scheme in question leaves it helpless. 

(ii) Such a scheme is very limited in its learn- 
ing capabilities, since it has access to very little 
information about its component properties ; in 
particular, it is incapable of “knowledgeably' 
modifying its tests or adding new ones — it can 
only modify the weightings given to the results 
of these tests in its ‘decisions.' 

In view of the limitations of the ‘property- 
list' pattern-recognition scheme just mentioned, 
we can formulate some requirements for a pat- 
tern-recognition scheme suitable tc replace it 
as a ‘transformation-selecting' mechanism. 
First, the scheme must have access to a repre- 
sentation of each ‘object' in terms of a ‘descrip- 
tive framework' for the subject matter in 


question which is suitable in that useful rela- 
tionships between ‘objects' can be extracted 
relatively simply ^from the corresponding rep- 
resentations. Furthermore, the transformation- 
selecting rules of the pattern-recognition appa- 
ratus should themselves be expressed in a 
representation suitable for a ‘learning mecha- 
nism' to revise the set of rules (i) by adding 
new rules and deleting those old ones which 
prove least useful as experience associates cer- 
tain object pairs with certain transformations 
and (ii) by replacing a set of particular rules 
by a ‘common generalization' rule again repre- 
sented in the same language. Such facilities 
could go far toward removing the limitations 
of which we have spoken and providing both a 
powerful rule language (the rules can be stated 
in terms of the ‘descriptive framework' we have 
postulated for the ‘objects') and a learning 
mode more sophisticated than any yet incorpo- 
rated in such a general problem-solving pro- 
gram. 

So far we have been enumerating desirable 
features in a ‘pattern-recognition' mechanism 
to be used as a transformation-selection device 
within a large problem-solver. What has all 
this to do with ANALOGY, which is not even a 
problem.-solving program of the class we have 
been considering? We suggest that ANALOGY 
can, under a suitable (rather drastic) reinter- 
pretation, be to some extent viewed as a pattern- 
recognition program having, to the limited de- 
gree appropriate for its particular environment, 
all the features we have listed. First, the 
‘objects' are the problem figures of ANALOGY 
and the suitable ‘descriptive framework' ap- 
propriate to these objects is the ‘subfigure and 
relation' representation used as the input part I 
generates for part II of ANALOGY. (Thus 
part I of ANALOGY corresponds to the appa- 
ratus that generates this representation for 
each object; that is, it goes from a representa- 
tion of the ‘problem objects' which is convenient 
for input to the problem-solver to one which is 
in a form suitable for internal use.) The gen- 
eration in ANALOGY of a transformation rule 
taking one answer figure into another can be 
thought of as corresponding to the first kind of 
learning we listed above, namely, the adding of 
rules as, with experience, the machine associates 
certain object pairs with certain simple or com- 
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posite transformations. Finally, the common 
generalization of two rules in ANALOGY cor- 
responds to the second kind of learning we 
mentioned, namely, the generation of a common 
generalization of several rules associating ‘ob- 
jects’ and ‘transformations.’ Furthermore, 
analogy’s process of choosing between ‘com- 
mon generalizations’ of different rule pairs mir- 
rors a process of selectively incorporating only 
those generalizations with the greatest dis- 
criminatory power. Under this interpretation, 
ANALOGY appears as a model for a pattern- 
recognition process with all the characteristics 
mentioned. The potential value of ANALOGY, 
viewed in this way, as a suggestive model for 
the construction of such pattern-recognition 
mechanisms for use within problem-solving pro- 
grams may prove to be the chief product of our 
work with ANALOGY and the best justification 
for having carried it out. 
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1. SIGNIFICANCE OF THE THEOREM- 
UTILIZING PROBLEM 

(1.1) Computers as Expert Problem Solvers 

There are a large number of difficult intellec- 
tual tasks which consist of synthesizing a par- 
tial ordering. To program a computer we must 
synthesize a sequence of computer instructions ; 
one way of constructing a logical derivation is 
by synthesizing a sequence of inference-rule 
applications which transform given premises 
into desired conclusion; one way of choosing 
among alternative moves in a game is by syn- 
thesizing and evaluating trees of possibly ensu- 
ing moves. Can we program a computer to 
perform such tasks? Can we program a com- 
puter to perform such tasks expertly, i.e., well 
enough that it can by itself surpass a skilled 
human being in the performance of such tasks ? 

A decade of work on the subject by Gelern- 
ter, Minsky, Newell, Samuel, Shaw, Simon, 
Slagle, Tonge, and others has given us a clear 
yes in answer to the first question. But the 
answer to the second is still very much in doubt. 
The comparisons so far have been with Navy 
boots, high school sophomores, and college 
freshmen. The question at issue concerns the 
difference between performing a sequence-syn- 
thesis task well and performing it poorly if at 
all. We would ask : What is skill in tasks like 
theorem proving and program writing? Sup- 
pose we have two machines each of which is able 
to prove at least some theorems but one rather 
like an unskilled student and other rather like 
a skilled mathematician. How do they differ? 


(1.2) Synthesizing Sequences of 
Symbol Transformations 

We must be a little more specific about the 
kind of task we are interested in. In logic, con- 
structing a derivation is constructing a se- 
quence of permissible symbol transformations 
by which given premises can be transformed 
into desired conclusions. The permissible sym- 
bol transformations are the basic rules of infer- 
ence. Thus in a system where the permissible 
transformations include detachment (i.e., trans- 
forming P and P Q into Q), interchange of 
conditionals and disjunctions (i.e., transforming 
P Q into P V Q and P v Q into ^ P Q), 
and commutation of disjunctions (i.e., trans- 
forming P V Q into Q V P), 2 in example of a 
derivation is the following : 

GIVEN PREMISES: 

(1) (Q^R) 

(2) P 

(3) ^R 

DESIRED CONCLUSION: -Q 

DERIVATION STEPS: 

(4) Q R Detachment transforma- 

tion of Premises (1) and 

( 2 ) 

(5) V R Interchange transforma- 

tion of Step (4) 

(6) R V Commutation transforma- 

tion of Step (5) 
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DERIVATION STEPS— Continued 

(7) Interchange transforma- 
tion of Step (6) 

(8) Detachment transforma- 
tion of Premise (3) and 
Step (7) 

Similarly in computing, constructing a pro- 
gram is constructing a sequence of available 
symbol transformations by which given input 
can be transformed into desired output. The 
available symbol transformations are the 
primitive computer instructions. Similar char- 
acterizations can be offered for game playing, 
puzzle solving, manipulating algebraic expres- 
sions, and other tasks. There are, of course, 
important differences among these tasks, espe- 
cially in the way the tasks are defined for the 
problem solver, but for the present our con- 
cern is with what they have in common. 

(1.3) Skill at Seque7ice Synthesis 

It would appear that one thing most se- 
quence-synthesis tasks have in common, at least 
insofar as human problem solvers are con- 
cerned, is a way in which skill at their perform- 
ance is acquired. Becoming a skilled logician 
or a skilled programmer mainly consists in 
learning useful combinations of the available 
primitive transformations and in developing 
a sensitivity to the occasions on which it is 
appropriate to apply these combinations. As 
we shall see, there is more to it than this, but 
this is a good place to start. 

Some examples will clarify our conception 
of skill. The easiest examples to come by are 
those where the skilled human being takes ex- 
plicit note of a useful combination of primi- 
tives. Thus the logician adds defined rules of 
inference to his system. Or he just simply adds 
a theorem to his list of established statements. 
For instance, very early in the development of 
a system for the propositional calculus the logi- 
cian will notice the usefulness of the transposi- 
tion theorem (Q R) ^ (r^R ^Q) and, 
after proving it with a sequence something like 
Steps (4) through (7) of our example above, 
will add it to his list of axioms and other proved 
theorems. Henceforth in developing a deriva- 
tion, whenever he needs to transform an expres- 
sion whose structure is like the antecedent of 


the theorem into an expression whose structure 
is like the consequent of the theorem, he merely 
cites the theorem. He does not indicate, as is 
done in the example above, all the primitive 
transformations, the primitive rules of infer 
ence, which are required actually to effect the 
transformation. Theorems are usually thought 
of as additions to the stock of assertions rather 
than as additions to the stock of transformation 
rules of a logical system. But we shall be pri- 
marily interested in them in their role as ex- 
plicit representations of defined, complex trans- 
formations, i.e., transformations which can be 
effected by combinations of primitives. 

In programming, the defined, complex trans- 
formations are the subroutines and macroes, 
some explicitly formulated, labelled, and stored 
on tape but many more, accumulated in the 
experience of writing many programs of many 
different kinds, “stored” more or less com- 
pletely only in the programmer’s head. 

(1.4) Construction Problems and Existence 
Problems 

An important distinction is to be noticed be- 
tween the kinds of sequence-synthesis prob- 
lems represented by programming problems on 
the one hand and by theorem-proving problems 
on the other. The programmer has to produce 
the actual sequence of primitive transforma- 
tions while the logician does not. This arises 
from a difference in the reasons why the two 
are interested in synthesizing sequences of 
primitive transformations. In logic the question 
of interest is whether such a sequence connect- 
ing premises and conclusion exists at all; if it 
can be shown to exist the logical problem is 
settled, and though the logician may be worried 
about problems of elegance and efficiency, these 
are extralogical worries. The programmer, on 
the other hand, must synthesize an actual se- 
quence of primitives. There is usually no ques- 
tion of whether such a sequence exists; the 
programming requirement is production of one 
to do some work. The programmer needs the 
actual sequence, not just the knowledge that one 
exists. 

We shall refer to the programmer’s problems 
as construction problems, opposing them to the 
logician’s existence problems. The distinction is 
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not a neat one because the logician usually uses 
methods such that from a demonstration of 
existence of a sequence of primitive inference 
rules he could automatically (and tediously) 
produce an actual sequence if called upon to do 
so. But the distinction is worth noting because 
it is construction problems with which we shall 
be primarily concerned in the present paper. 

(1.5) Related Research 

There has been a large amount of work in 
the recent past concerned with programming 
computers to prove theorems (e.g., Wang,^^ 
Gilmore J Davis and Putnam, ^ and Robinson^®) 
and with programming computers to program 
themselves (e.g., Kilburn et aL^ and Amarel).^ 
Though several of these authors, e.g., Wang 
and Amarel, respectively, have commented on 
the apparent importance of a mechanical 
theorem prover’s being able to save and utilize 
previously-proved theorems and of a self-pro- 
gramming machine's being able to save and 
utilize previously-developed subroutines, there 
has been little work directly on the problems 
raised. Programs with at least some ability to 
utilize previously-proved theorems (or their 
counterparts) are Newell, Shaw, and Simon's 
Logic Theorist, IS’ Gelernter's Geometry 
Theorem Prover,s.6 Slagle’s Automatic Integra- 
tor,22 and Simon's Heuristic Compiler.^i We 
would suggest that it is because it attempts to 
prove theorems by using previously-proved 
theorems that the Logic Theorist is of special 
importance even though it turns out to be a 
rather weak theorem prover in comparison 
with other systems which use alternative for- 
mulations and methods. Wang avers that 
Newell, Shaw, and Simon's approach to prov- 
ing theorems by machine is rather like trying 
to kill a chicken with a butcher knife. It is true 
that there are more efficient ways to dispatch a 
chicken, but there may be no better way to 
learn how to use a butcher knife. 

We turn, then, to reporting an investigation 
concerned not with theorem-proving or self- 
programming machines per se but with the 
more specific problem of how such sequence- 
synthesizing machines might be made efficiently 
to exploit previously-developed sequences when 
later given a problem of constructing a more 
complex sequence. Our approach has been actu- 


ally to write and run what can reasonably be 
called a theorem-utilizing program. We shall 
first describe the program. Then we shall re- 
port some results obtained with it and some 
conclusions to be drawn from these results. 
Finally, we shall indicate what appears to be a 
reasonable direction for future research on the 
problem. 

2. DESCRIPTION OF A THEOREM- 
UTILIZING PROGRAM 

(2.1) Criteria of Program Evaluation 

Our goal has been to construct an automatic 
theorem-proving system which proves theorems 
by efficiently utilizing previously-proved theo- 
rems. The criterion of success is realization of 
a theorem prover which becomes progressively 
better as it accumulates and otherwise modi- 
fies its store of previously-proved theorems. It 
should be remarked that improvement of the 
store is not merely a matter of accumulation 
and, as a matter of fact (as clearly demon- 
strated by Newell, Shaw, and Simon's^® work), 
under certain conditions simple increase in size 
of the store can hinder rather than help prob- 
lem-solving ability. This ability is more a 
matter of what particular previously-proved 
theorems are available and what information 
is stored about them and how this information 
is used, than it is a matter of how many are 
available. It is specifically to these points of 
how to select useful theorems for remember- 
ing, how to abstract them and their proofs, and 
how to use the abstracted information that the 
research here reported has been directed. 

(2.2) The Program's Problem Domain 

For our experiments with theorem proving 
we devised a special problem domain rather 
than selecting some standard domain such as 
the propositional calculus, group theory, or 
plane geometry. We shall presently give our 
reasons for doing this, but first let us describe 
the domain. It will probably be simpler if we 
first present problems in the domain as if they 
were problems of programming a very simple 
computer best characterized as a vector adder. 
The computer has no data memory. It is given 
as input an ordered n-tuple of numbers and 
operates on this n-tuple according to a given 
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program. The program is a simple sequence 
with each of its constituent basic instructions 
operating directly on the output of the immedi- 
ately preceding instruction in the sequence. 
Each of the basic instructions adds a quantity 
(possibly zero or negative) to each term of the 
n-tuple on which it operates. Thus each of 
these instructions can also be represented as an 
ordered n-tuple of numbers. 

Consider an example of a 4-tuple adder which 
has as its basic instructions (S) <0,1,2, 3>, 
(T) <3,2,1,0>, and (W) <0, — 1,1, 0>. Atypi- 
cal programming problem might be that of syn- 
thesizing a sequence of these instructions which 
will transform <4,9, 2, 7> into <10,10,10, 10>. 
As can easily be verified, one program which 
will do this is (S), (T), (T), (W), (W), (W), 
(W). The first instruction (S) transforms 
<4,9,2,7> into <4,10,4,10>, the second in- 
struction (T) transforms this into <7,12,5, 10>, 
etc., until operation of the final instruction (W) 
gives an output of <10,10,10,10>. 

Problems of the kind we are considering can 
just as well be interpreted as theorem-proving 
problems in what might be called a tally cal- 
culus. Thus, rather than the n-tuple <4,9,2,7> 
we might have the expression 

AAAA/BBBBBBBBB/CC/DDDDDDD 
and we might ask whether 

AAAAAAAAAA/BBBBBBBBBB/ 

CCCCCCCCCC/DDDDDDDDDD 

can be derived from it given basic rules of in- 
ference corresponding to (S), (T), and (W). 
These rules are to be interpreted in such a way 
that (W), for instance, means: make no change 
in the A’s, delete one B, add one C, and make 
no change in the Z)'s. 

In such a tally calculus there might well be 
existence problems as well as construction prob- 
lems. Most of the calculi with which we have 
actually worked, however, have been supplied 
with basic rules of inference sufficient to make 
possible the derivation of any expression in 
the calculus from any other expression. Our 
reason for choosing such calculi has been that 
we are interested in problem-solving mecha- 
nisms and not in whether some particular deri- 
vations are possible, and we learn little from 


applying these mechanisms to problems which 
have no solution. It will be a very long time 
before we can hope to have non-algorithmic 
problem-solving mechanisms powerful enough 
that their failure to discover a solution is pre- 
sumptive evidence that no solution exists. 

We have imposed on our problem solver re- 
quirements additional to those of simply con- 
structing a sequence of transformations which 
will transform a given input n-tuple into a de- 
sired output n-tuple: (1) There are lower and 
upper bounds on the numbers intervening be- 
tween input and output ; thus we might require 
that no intervening number be less than 0 or 
greater than 19. (2) Even within these bounds, 
there can be certain proscriptions; thus we 
might proscribe any n-tuple containing the 
number 8 from intervening between start and 
goal. 

A comment is in order concerning our rea- 
sons for choosing such an apparently artificial 
problem domain. Our intention was to design 
a vehicle with which we could conveniently 
investigate utilization of theorems or sub- 
routines, one simple enough that most of the 
program design effort could be concentrated 
directly on such utilization rather than on mat- 
ters of other kinds, e.g., pattern recognition. 
As we shall see, in this we were only partially 
successful. Further, we wanted problems pos- 
sessing some of the features of programming or 
theorem-proving problems but not so many of 
these features that an extremely complex pro- 
gram would be required to solve any problems 
at all. Finally, we wanted a problem domain 
which did not present difficult problems for us. 
This allows us to take an Olympian view of 
what our mechanical problem solver is doing. 

Let us see how our problems of programming 
a vector adder compare with programming or 
theorem-proving problems which people actu- 
ally have to solve. In our case, there are no dif- 
ficulties associated with merging two lines of 
processing into one. Formally this is because 
the constituent transformations (the basic in- 
structions or rules of inference) are all single- 
argument functions, a situation which holds 
neither in real-life programming nor in real- 
life theorem proving. Further, there are no 
difficulties in our case associated with branch- 
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ing or iteration or recursion. Thus our prob- 
lems are far simpler than actual programming 
or theorem-proving problems, but they have 
enough of the features of these actual prob- 
lems to be interesting. In particular, our prob- 
lems involve discovering a combination of basic 
transformations which will effect some particu- 
lar complex transformation and, further, of dis- 
covering an acceptable ordering of these basic 
transformations. The requirement that they be 
ordered in a particular way derives from our 
imposing bounds and proscriptions on an ac- 
ceptable solution. Thus, in the example given 
above, if we were to stipulate that no ^-tuple 
containing a number larger than 10 might in- 
tervene between start and goal, the suggested 
sequence (S), (T), (T), (W), (W), (W), (W) 
would no longer be a solution but the identically 
constituted though differently ordered sequence 
(W), (W), (W), (W), (S), (T), (T) would be. 
These various aspects of our problem domain 
should become clearer with the examples to 
follow. 

(2.3) Spatial Interpretation of the Problem 
Domain 

An Olympian overview is facilitated by giv- 
ing the n-tuples a spatial interpretation. Thus 
in the case of pairs (^-tuples), we can correlate 
the start and goal pairs, the. intervening pairs, 
and the proscribed pairs with cells in a grid. 
In Figure 1, for instance, we correlate a start 
pair <4,10 > with the cell marked “S,"’ a goal 
pair <4,14> with the cell marked ‘‘G,’' and 
the proscribed pairs with the blacked-out cells. 
Given the pairs <2,1>, <2, — 1>, < — 1,2>, 
and < — 1, — 2> as the basic transformations 
available, the problem can be interpreted as 
that of working out a path (like the one in the 
figure) with these basic transformations as 
constituent steps, with no step ending in a 
blacked-out cell, and which begins at S and ends 
at G. 

By supplementing the set of basic transfor- 
mations of the example so that we have one 
transformation for each of the eight legal 
knight’s moves in chess, we have the problem 
of moving a knight around on a chessboard. 
We have christened the system defined by the 
four listed transformations the half-knight cal- 
culus (HKC) and that defined by the full set 
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Figure 1. Spatial Interpretation of a ^-Tuple 
Transformation Problem. 

The problem consists in constructing a path, like the 
one indicated, from S to G. The path must be con- 
stituted only of allowed steps and none of these steps 
can end in a blacked-out cell or out of bounds. 
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of eight the full-knight calculus (FKC). In 
order to avaiFourselves of the two-dimensional 
spatial interpretation, most of our work has 
been with ^-tuple systems like these. (Both 
the HKC and FKC are, by the way, complete 
in the sense that their basic transformations 
are sufficient to transform any pair into any 
other pair.) 

The powers of human visual perception and 
spatial intuition are very useful in analyzing 
our mechanical problem solver’s performance, 
in comparing and evaluating its solutions and 
attempted solutions, and in deciding on proce- 
dures which might be useful to it. There is some 
danger in using this spatial interpretation, 
however, for one is likely to be misled concern- 
ing the nature and difficulty of the problems 
insofar as the computer is concerned. We would 
specifically warn the reader against a tendency 
to look upon the problems as problems of run- 
ning a maze with the connectivity of the nodes 
corresponding to the directly perceived con- 
nectivity of cells in the grid. And we would 
suggest that he can grasp the difficulty of solv- 
ing the problems involved without any aid 
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from visual perception and spatial intuition, as 
the computer must solve them, if he thinks of 
what his own situation would be in the five- 
or six-dimensional case. 

(2.4) A Succession of Problems from Easy to 
Hard 

Stipulation of lower and upper bounds and 
of proscribed n-tuples defines a problem field. 
For our experiments with ;^-tuples we defined 
five fields, each with a lower bound of 0 and an 
upper bound of 19 but with the differing sets 
of proscribed pairs represented by the blacked- 
out cells in Figures 2 through 6. It should be 
understood, again with respect to possibly mis- 
leading aspects of the spatial interpretation, 
that the research has not been concerned with 
designing a system which would learn local 
properties of the five fields. Our reasons for 
restricting the number of fields rather than 
using a different one for each different problem 
was solely one of convenience. Since only a part 
of each field is relevant to a particular prob- 
lem, the five fields serve to determine a variety 
of ordering constraints sufficient for our pres- 
ent purposes. 

Given the five fields, a succession of 60 pro- 
gressively more difficult problems was deter- 
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Figure 3. Second Problem Field. 

The problem illustrated is Problem 24 : within the 
given field of proscribed pairs, transform <18, 6> into 
< 11 , 12 >. 



Figure 4. Third Problem Field. 

The problem illustrated is Problem 37 : within the 
given field of proscribed pairs, transform <8, 9> into 
<18, 19>. 


Figure 2. First Problem Field. 

The problem illustrated is Problem 6 : within the 
given field of proscribed pairs, transform <17, 10> into 
<17, 8>. 


mined by using each of the fields 12 times and 
for each use defining the numbers for the start 
pair and the goal pair with a semi-random 
process controlled so that the difference be- 
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Figure 5. Fourth Problem Field. 

The problem illustrated is Problem 48: within the 
given field of proscribed pairs, transform <3, 8> into 


<11, 18>. 



Figure 6. Fifth Problem Field. 

The problem illustrated is Problem 55 : within the 
given field of proscribed pairs, transform <9, 15> into 
<18, 1>. 

tween the numbers was likely to be greater for 
the later problems in the succession. Figures 2 
through 6 represent Problems 6, 24, 37, 48, 
and 55, respectively. 


(2.5) Storage of Theorems 

We can now briefly describe the program 
with which we are investigating theorem uti- 
lization and which so far has been used to test 
some ideas concerning how utilization of solu- 
tions of the easy problems early in the succes- 
sion of 60 problems might facilitate solution of 
the more difficult problems later in the succes- 
sion. The most important idea was that once 
a problem was solved the solution should be 
generalized and precisely categorized in a cer- 
tain way, with the solution’s possible relevance 
to the solution of later, more difficult problems 
being determined by its categorization. We 
wished to devise something more efficient than 
mere addition of new theorems to an unordered 
theorem list which then had to be scanned in 
its entirety each time a search was made for 
possibly useful theorems, as was the case with 
Newell, Shaw, and Simon’s original Logic 
Theorist (with the exception of one experiment 
where the theorems were categorized and 
weighted on the basis of their experienced use- 
fulness in connection with particular methods). 

One can use various techniques to make the 
selection of theorems possibly useful for a par- 
ticular problem more efficient than a brute- 
force search of an unordered list. For instance, 
Newell, Shaw, and Simon used certain similar- 
ity tests in the search of the Logic Theorist’s 
theorem list; they experimented with enhanc- 
ing search efficiency by pre-computing the in- 
formation for each theorem needed by these 
tests and storing it along with the theorem, 
thus obviating its computation each time the 
list was searched. Although the indexing was 
inverted, this represents a simple case of the 
kind of theorem categorization which we have 
wished to implement and test. Stefferud-’^ has 
modified the Logic Theorist so that previously- 
proved theorems are stored not on a simple list 
but as part of a complex list structure (called 
a theorem map) which compactly represents 
the syntactical structures of all the theorems 
available. Theorems are selected for possible 
usefulness in connection with a given problem- 
atic expression by a single operation matching 
the syntactical structure of the expression 
against this theorem map. That part of the 
map with which an appropriate match is ob- 
tained determines the whole set of previously- 
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proved theorems which have the appropriate 
syntactical structure for use with the given 
expression. Our approach is not unrelated to 
Stefferud's but we would like to work out a 
theorem-selection system which uses clues to 
the usefulness of a previously-proved theorem 
more powerful and more selective than whether 
or not there is a precise match between the 
syntactical structure of part of the theorem and 
the syntactical structure of part of the expres- 
sion to be proved. What such clues might be 
in the particular case of a propositional calculus 
like that with which Steflferud is working we 
leave for consideration in another place; here 
let us see what we mean for systems like the 
half-knight and full-knight calculi. 

It will be recalled that the problem of con- 
structing a derivation in one of these calculi 
consists in discovering what combination of 
basic transformations will transform a start 
pair into a goal pair and in ordering these 
transformations so that none of the interven- 
ing pairs are out of bounds or on the list of 
proscriptions. For the experiments here re- 
ported what was saved of any particular suc- 
cessful derivation was information about how 
many applications of each basic transformation 
were required plus the defined transformation 
which this combination was able to effect. This 
was abstracted from the order in which the 
basic transformations were needed and from 
the particular start pair and goal pair which 
had defined the problem. Thus, as illustrated 
in Figure 7, the sequence < — i, — 

< — 1 , 2 ':> might be used to solve the problem 
of getting from <1,2> to <1,3>. What will 
be saved about this solution is that it uses one 
each of the indicated basic transformations and 
that this combination will effect the defined 
transformation <0,1 >, i.e., it will transform 
any pair <X,Y> into <X,Y+1>. Now if the 
system is later confronted with the problem, 
perhaps as a subproblem of some larger prob- 
lem, of transforming <11,8> into <11,9> in 
the problem field also as illustrated in Figure 7, 
it might retrieve and try to use the information 
it has saved from the earlier solutions ; but after 
it has retrieved the information it has the ad- 
ditional task of ordering the basic transforma- 
tions in a manner which satisfies the con- 
straints of the new problem. Because of the 
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Figure 7. Use of a Previously Developed Solution 
to Solve a New Problem. 

A combination of allowed steps which has been dis- 
covered to solve the problem <1, 2> to <1, 3> can 
later be used to solve <11, 8> to <11, 9>. But a differ- 
ent ordering of the steps will be required for an ac- 
ceptable solution. 

role they serve as defined transformations, let 
us call the abstracted solutions “theorems’’ 
even though they may not look much like what 
the logician usually refers to w’ith this term. 

The categories on the basis of which theo- 
rems are filed and retrieved must of course be 
relevant to the information which is saved 
about the theorems. Four easily computable 
binary properties have been used to determine 
these categories. For each theorem let us call 
the change it makes in the first term of a pair 
on which it operates its X-value and the change 
it makes on the second term its Y-value. The 
four binary properties used to categorize a 
theorem are whether its X-value is positive, 
whether its Y-value is positive, whether its 
absolute X-value is greater than or equal to its 
absolute Y-value, and whether its absolute 
Y-value is greater than or equal to its absolute 
X-value. Tests to determine the values of these 
properties for a particular theorem are ar- 
ranged on a simple discrimination tree (which 
can easily be expanded or otherwise modified 
for experimentation with alternative proper- 
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ties) . When filed, a theorem is sorted down this 
three into one of eight categories correspond- 
ing to octants in a Cartesian coordinate sys- 
tem. The theorems in one category are put on 
an ordered list with the position of a theorem 
on the list being determined by the sum of its 
absolute X-value and its absolute Y-value. The 
basic transformations of a calculus are placed 
on these lists right along with the defined 
transformations. It will be noticed that the 
FKC has one basic transformation for each of 
the eight lists. 

(2.6) Retrieval of Theorems 

For our problem domain, problems can be 
characterized with exactly the same properties 
used to characterize theorems. Thus the pri- 
mary technique used to select a theorem possi- 
bly relevant for solution of a problem is to sort 
the problem down the same discrimination tree 
used to separate and file theorems. Given the 
relevant list of theorems, the first selection is 
that one which leaves the smallest remaining 
problem. (More precisely: given a problem of 
transforming S into G, first selected is that 
theorem T for which the sum of the absolute 
X-value and absolute Y-value of the problem 
T(S)-to-G is smallest.) Lower priority selec- 
tions are made by alternately taking theorems 
from above and below the first choice on the 
theorem list being used or, if the list is too 
short to provide enough selections as deter- 
mined by an externally specified parameter, by 
making selections from the lists for neighbor- 
ing categories. 

We should make two clarifying remarks be- 
fore proceeding with program description. 
There is a difficult question when specifying a 
system such as the one being described concern- 
ing the criteria which should be used in select- 
ing a theorem to be tried. About the only thing 
which recommends the size of T(S)-to-G as a 
primary criterion in our case is that it is easily 
and quickly computable. A more sophisticated 
system would attempt an evaluation of the dif- 
ficulty of actually being able to realize T(S) 
with T given the ordering constraints ; and an 
evaluation, with measures better than size 
alone, of the problem T(S)-to-G. Gelernter’s 
oft-cited use of a semantic model was directed 


toward evaluating the counterpart of T(S)- 
to-G for his Geometry Theorem Prover. (Work- 
ing backwards, what his system actually had 
to evaluate was S-to-T-i(G). It did this by 
testing whether T-^(G) was true in a model 
determined by S, the given premises.) 

A second clarifying remark concerns sorting 
problems down the same discrimination tree 
used to file theorems. In general, there might 
be properties which would be useful for cate- 
gorizing theorems but which would be inappli- 
cable to problems or vice versa. In our present 
case, for instance, the total number of basic 
transformations involved in their definition 
might be a property useful for categorizing 
theorems. Problems have no such property. 
There would be, however, a corresponding prob- 
lem property whose value could be tested at the 
same point in the sorting tree. There would 
have to be such a problem property else the 
theorem property would not be a useful basis 
of categorization. It should not be forgotten 
that the categorization is for purposes of deter- 
mining relevancy of a theorem to a problem. 

To continue with program description: The 
first thing the program does when attempting 
to solve problem.s is to construct a tree of pos- 
sibly useful theorems. The number of branches 
from each node is determined by an externally 
specified parameter N. The program selects N 
theorems possibly useful for the problem 
S-to-G; then it recursively selects N theorems 
for the problem T(S)-to-G determined by the 
best T of its first N selections ; then it recursively 
selects N theorems for the problem U(T(S))- 
to-G determined by the best U of its second 
N selections ; etc. Recursion is terminated 
either when a closure is effected, i.e., when a 
sequence of selected theorems together trans- 
form S into G (ignoring the ordering con- 
straints) ; or when a maximum allowable depth 
of recursion, as determined by an externally 
specified parameter, is reached. At the point 
when recursion is first terminated, the tree will 
have the structure indicated in Figure 8. The 
non-maximum-depth terminal nodes will be ex- 
tended to full depth only if necessary, i.e., if 
the theorem choices from the completed part 
of the tree do not result in a solution. 
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Third N selections 


N = 3 (fof problem U(T{5))-fo-G) 



Figure 8. Tree of Theorem Selections. 

The tree is extended in the manner indicated until 
closure is effected or until a recursion limit is reached. 
If the top line of theorems (T, U. V, etc.) does not 
result in a solution, other lines (T, U, V', etc.) will be 
tried. The tree will be extended from the lower nodes, 
e.g., that for T', whsn but only when necessary. 

(2.7) Application of Retrieved Theorems 

Given the tree of selected theorems, the pro- 
gram now picks up from the top down (as the 
tree is represented in Figure 8) sequences of 
theorems which effect a closure, trying one 
after another until it attains a solution, runs 
out of allowed time, or exhausts the tree. Given 
a closure-effecting sequence of theorems, the 
program still has a difficult task to accomplish 
for it must order the constituent basic trans- 
formations so that the path of pairs interven- 
ing between start and goal is in bounds and 
does not include any proscribed pairs. The 
ordering mechanisms constitute the longest and 
most complex part of the program. (The entire 
program is approximately 4500 IPL instruc- 
tions in length.) However, we shall not here 
describe these mechanisms because their opera- 
tion is not directly relevant to our present 
theme of theorem utilization (though, as we 
shall see, their complexity and the fact that 
they don’t work very well are directly relevant 
to conclusions to be drawn concerning effective 
use of theorems by machine). One of these 


mechanisms uses a recursive technique of 
breaking problems into subproblems, and we 
shall say something about this particular, very 
important mechanism in the next section. 

(2.8) Breaking Problems into Subproblems 

A number of authors, especially Minsky, 
have remarked on the importance of enabling 
problem-solving programs to transform a large 
problem into a constellation of smaller prob- 
lems, “to plan” as Minsky calls it. This has 
turned out, however, to be a very difficult capa- 
bility to implement, and there are not many 
(if any) operating programs which have it, 
though Newell, Shaw, and Simon have done 
some work toward incorporating a planning 
capability in their General Problem Solver. We 
suggest that theorems (or, more generally, what 
we have called abstracts of previously devel- 
oped sequences) might well be primary instru- 
ments of planning. When planning, a human 
sequence synthesizer usually is especially inter- 
ested in discovering a special kind of subprob- 
lem, viz., one very like problems he has already 
solved. A good deal of human sequence-synthe- 
sizing activity is an attempt to transform a 
problem into a constellation of subproblems 
each of which belongs to a familiar kind of 
problem, familiar in the specific sense of the 
problem solver’s having available abstracts of 
sequences previously developed as solutions for 
problems of just this kind. The kinds of fa- 
miliar problems will be more or less general 
and the abstracts of previous solutions more 
or less complete, and that’s why, even after 
retrieval of the abstracts, the familiar prob- 
lems are still problems. So in our case, the re- 
trieved theorems often won’t work. We shall 
presently want to examine why, but here we 
simply want to point out the connection between 
planning and theorem utilization. 

There are besides theorem utilization other 
kinds of planning, many of which appear to be 
special to particular problem domains. One 
such special kind is used by our present pro- 
gram when faced with the task of ordering a 
set of basic steps so that none of them result 
in a proscribed pair of numbers. When start- 
ing on any problem, the program has no way 
of determining which if any of the given pro- 
scriptions in the problem field are relevant to 
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the problem. The reader can grasp the nature 
of this difficulty by imagining himself given the 
proscriptions simply as a list of pairs of num- 
bers rather than in their spatially interpreted 
form. He then has no easy way of telling which 
of these pairs might get in the way as he at- 
tempts to work out a path between one pair of 
numbers S and another pair G. The program 
proceeds, to begin with, as if none of the pro- 
scriptions were relevant. As it tries one path 
after another it compiles a list of those pro- 
scribed pairs which have prevented success. 
This list's reaching a certain size, as deter- 
mined by an externally specified parameter, is 
taken to indicate that success is unlikely with- 
out taking into consideration the specific con- 
figuration of proscriptions which are relevant. 

Taking its occurrence on the list as evidence 
that a particular proscription is relevant, the 
program determines rows and columns (speak- 
ing in terms of the spatial interpretation) of 
relevant proscriptions and then looks for gaps 
in these rows and columns through which a 
successful path is apparently going to have to 
proceed. Cells within these gaps are then 
selected as midpoints defining possible sub- 
problems, each such cell M determining the sub- 
problems S-to-M and M-to-G. Several such 
M's might be selected at one time, and they 
are put on an ordered list with the position of 
an M on the list being determined by criteria 
like the number of proscriptions in its immedi- 
ate vicinity. The order of this list determines 
the order in which attempts are made to solve 
the subproblems determined by the M's. 

Given a pair of subproblems S-to-M and 
M-to-G on which to work, the system attacks 
these recursively, approaching each with the 
same mechanisms it originally applied to the 
problem S-to-G. Thus S-to-M might be broken 
into S-to-M' and M'-to-M, and so on. The only 
difference between operations of the program 
at one subproblem level and at another is that 
the lower the level the less the effort allowed 
per subproblem before reporting failure. The 
amount of effort expendable is controlled by 
reducing the potential size of the tree of 
theorem selections at each recursion onto a sub- 
problem, as illustrated in Figure 9. This reduc- 
tion also provides an effective means of ensur- 
ing that recursion will terminate. 



2)At Subproblam L«v«l 1 


3)At Subproblam L«v«l 2 

Figure 9. Theorem Selection Trees at Different 
Subproblem Levels. 

The amount of effort spent on subproblems is controlled 
by reducing the potential number of theorem combina- 
tions which can be tried each time descent is made to 
a lower level. 
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Figure 10. Recursive Solution of Subproblems of 
Problem 37. 

The original problem S-to-G was solved by breaking it 
into the subproblems S-to-M and M-to-G and recur- 
sively attacking these. The problem M-to-G was further 
broken into M-to-M' and M'-to-G before success was 
achieved. 
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A solution obtained by the method of deter- 
mining subproblems and recursively attacking 
them is illustrated in Figure 10. The original 
problem <8,9> to <18,19> was broken into 
<8,9> to <1,12> and <1,12> to <18,19>, 
and the latter of these in turn into <1,12> to 
<1,14> and <1,14> to <18,19> before suc- 
cess was attained. This simple example does 
not illustrate the frequent occurrence of a num- 
ber of alternative midpoints being tried and 
discarded before usable ones are hit upon. 

(2.9) Deciding Which Theorems to Save 

We earlier raised the problem of improving 
a theorem file in ways other than simply adding 
to it. There is built into the program a mecha- 
nism for periodically culling from the file theo- 
rems which have been of little use. We shall 
not report more fully on the mechanism in this 
paper, however, because our experiments have 
not yet provided us with the opportunity to 
test the mechanism and to compare it with 
alternatives. 

3. RESULTS OF EXPERIMENTS 

(3.1) Vindication of Theorem Utilization 

Sometimes the designer’s product doesn’t 
work at all; the airplane crashes or the ship 
sinks. So the first questions to ask about a 
complex problem-solving program are “Does 
the program work when all the pieces are put 
together? Does it solve problems?” We are 
happy to report in the present case that the 
answer is yes. The program does solve prob- 
lems, in some cases quite elegantly (Figure 11 
shows one machine solution of Problem 56 in 
the FKC), in other cases very inelegantly (Fig- 
ure 12 shows another machine solution of the 
same problem) , but of course in some cases not 
all (Figure 13 shows a good but unsuccessful 
try for Problem 38 in FKC). 

More to the specific point of the experiments, 
the problem-solving power of the system was 
substantially increased by its ability to utilize 
theorems. We tested this by comparing its per- 
formance on Problems 51 through 60 without 
any transformations in its file except the basic 
ones of the FKC and its performance on these 
same ten problems with its file containing the 
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Figure 11. A Good Machine Solution to Problem 56. 
The program is able to solve problems — in some few 
cas3s, elegantly. (The illustrated solution is in the 
FKC: allowed steps corresponding to the eight legal 
knight's moves in chess.) 

basic transformations of the FKC plus all the 
defined transformations (about 60) which it 
had accumulated from solving those which it 
could of Problems 1 through 50 and associated 
subproblems. Almost all of the time of the theo- 
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Figure 12. A Poor Machine Solution to Problem 56. 
The program is able to solve problems — but, in many 
cases, very inelegantly. (The illustrated solution is in 
the FKC.) 
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Figure 13. A Good But Unsuccessful Machine Try. 
The program in any of its versions was unable to solve 
Problem 38. To get an idea of the kind of ordering 
constraints the program is up against, the reader might 
try moving a knight from S to G without letting it 
touch down on any of the blacked-out cells. 

remless machine was spent in searching for 
combinations of the basic transformations 
which together would transform start pair into 
goal pair. Given its difficulty in discovering 
any combination at all which would effect this 
closure, it was unable to solve any but the sim- 
plest of the ten problems. Successful solution 
of the more difficult problems required having 
a variety of these combinations easily avail- 
able from which to discover one which could 
be ordered to satisfy the ordering constraints. 
The machine with theorems was quickly able 
to discover combinations of basic transforma- 
tions which effected a closure and could spend 
its time attempting to arrange these combina- 
tions in an acceptable order, quickly discover- 
ing an alternative combination in case further 
work on some given combination appeared 
futile. 

Examination of the relative performance of 
the two machines (the theoremless machine 
and the machine with theorems) on Problem 60 
is instructive. Figure 14 illustrates the solu- 
tion which the machine with theorems was able 
to achieve in 48.5 minutes (and in 1,276,809 
IPL cycles). This solution involved a total of 
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Figure 14. A Hard-Won Machine Solution of 
Problem 60. 

Even with the use of previously-provided theorems, the 
achievement of this solution required a great deal of 
effort (over a million IPL cycles). Without theorem 
utilization the program was unable to solve the problem 
at all. (The illustrated solution is in the FKC.) 

13 different closures. In contrast, the theorem- 
less machine in 60 minutes effected only two 
closures, with combinations which it was un- 
able to order satisfactorily, and fruitlessly 
spent the rest of the hour allowed it trying to 
effect a third. 

In at least one important sense the HKC and 
FKC derivations required in the present experi- 
ments are not trivial; this is with respect to 
the very large number of basic transformations 
required to effect them. For the problem illus- 
trated in Figure 15, for instance, 13ie solution 
achieved by our program (witii thecffems) is 
44 steps long; an apparently optimum solution 
(achieved by a human being) is 28 steps long. 
A question raised by these performanees is 
whether there are any general strategies 
(heuristics), other than utilization of previ- 
ously developed subsequences, powerful enough 
to enable ssmthesis of very long sequences in 
non-trivial cases like theorem proving and pro- 
gramming. It might be suggested that all the 
machine needs is something like the space- 
perception ability which enabled the human 
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Figure 15. Machine and Human Syntheses of 
Long Sequences. 

The machine-produced sequence (heavy lines) is 44 
steps long, the human-produced sequence 28 steps long. 
The human’s visual abilities enabled him to get along 
without much prior experience on this kind of task. But 
without such abilities the machine needed prior experi- 
ence to synthesize a sequence so long. (The illustrated 
solutions are in the FKC.) 

being to synthesize his 28-step sequence without 
much prior experience on this kind of task. But 
this is just to shift the problem to another 
domain. Given the spatial interpretation of 
the problem, the human being makes extensive 
use of experience, if not ontogenetic then 
phylogenetic. 

So the question still stands. Certainly no 
mechanical game player which looks ahead 
move by move is able to come close to looking 
28 (or 44) moves ahead. Samuel’s checker 
player^® might achieve such farsightedness but 
it is of interest to note that this is with a tech- 
nique which is a kind of utilization of pre- 
viously-developed (trees of) sequences. When- 
ever the checker player, using what Samuel 
calls its rote memory, applies to positions X 
moves ahead a previously-computed evaluation 
which involved looking Y moves ahead, this 
provides it with an evaluation identical with 
what it would achieve by directly synthesizing 
and evaluating sequences X-|-Y moves in 
length. The effect is cumulative so that even- 


tually the checker player is able to look ahead 
a very great distance. Samuel’s work, by the 
way, leads to an important conclusion relative 
to what we have called the theorem-utilization 
problem : however powerful theorem utilization 
might be, attainable degree of theorem gener- 
alization and attainable efficiency of theorem 
retrieval impose limits. Samuel was able to use 
retrieval techniques as efficient as binary 
search but was unable to achieve any signifi- 
cant degree of generalization ; as a consequence 
his file of experience would reach an unmanage- 
able size before it enabled his machine to play 
good checkers at any stage of the game. He had 
to turn to techniques other than “theorem uti- 
lization” in order to achieve good mid-game 
playing ability. It should be noted, however, 
that these other techniques were not alterna- 
tive ways of synthesizing longer sequences but 
were improved ways of evaluating given 
sequences of limited length. Hence, our ques- 
tion still stands: are there any strategies, 
other than utilization of previously-developed 
sequences, powerful enough to enable synthesiz- 
ing of very long sequences in difficult problem 
domains if this is what we have to do, say to 
program a computer? 

We might apply the question directly to 
Newell, Shaw, and Simon’s General Problem 
Solver (GPS).^'^ The primitive operators used 
by GPS correspond to what we have called basic 
transformations of a sequence-ssmthesizing 
problem domain. Given this correspondence, 
the many similarities between the operation of 
our program in its theoremless mode and the 
operation of GPS are obvious. Of course, add- 
ing to GPS an ability to accumulate and effi- 
ciently to use defined operators may not prove 
especially difficult, though it raises questions 
of exactly the kind with which we are con- 
cerned in this paper. 

To complete our case for the indispensability 
of theorem utilization for difficult sequence syn- 
thesis, we might temporarily look up from our 
experiments and appeal to authority. After 
making the point that non-trivial logical deduc- 
tions usually require many steps of inference, 
Carnap^ (pp. 36-37) remarks, “In practice a 
deduction in science is usually made by a few 
jumps instead of many steps. It would, of 
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course, be practically impossible to give each 
deduction which occurs the form of a complete 
derivation in the logical calculus, i.e., to dis- 
solve it into single steps of such a kind that 
each step is the application of one of the rules 
of transformation of the calculus. . . . An ordi- 
nary reasoning of a few seconds would then 
take days.” We suggest that, at least in this 
case, as it goes for men, so it goes for machines. 

(3.2) Failure to Satisfy Goal Criterion 

We must now turn to reporting and analyz- 
ing less positive results of our experiments. 
Over-all, our program was not a very good 
problem solver. For the HKC, in two runs (dif- 
fering in respects inessential to our present in- 
terest) through the entire succession of 60 
problems, the program was unable to solve 33 
in one case and 30 in the other. For the FKC, 
in two runs the numbers were 21 and 17. Of 
course, increasing the time allowed per prob- 
lem would have improved this performance 
somewhat, but we experimented enough with 
this variable safely to conclude that the time 
limit would have had to be extended unreason- 
ably to assure success with most of the un- 
solved problems and, in any case, time taken 
to achieve solution is an important measure of 
the power of a heuristic problem solver. Even 
more disappointing than these tallies of un- 
solved problems was their distribution. It was 
mainly problems in the later part of the suc- 
cession of 60 which were not solved, the dif- 
ficult problems requiring for their solution use 
of solutions obtained for problems occurring 
earlier in the succession. The obvious conclu- 
sion is that “theorem utilization” wasn’t work- 
ing very well. 

At first we thought that perhaps the failure 
was a matter of the succession of problems not 
being long enough, i.e., that the program just 
needed more simple problems from which to 
build up a store of experience before being able 
successfully to attack the difficult ones. We at- 
tempted a minor empirical test of this possi- 
bility by giving the program a second chance 
on the unsolved problems from the first run 
through the succession of 60, a second chance 
during which it could use for any given prob- 
lem theorems acquired after its first attempt 
on the problem in addition to the theorems 


which had been available at the time of the first 
attempt. No more of the difficult problems 
were solved with the additional theorems than 
had been solved in the first place. 

(3.3) Implications of the Failure for Theorem 
Utilization 

In the difficult cases the program was failing 
because it did not possess techniques powerful 
enough to deal with severe ordering constraints. 
For problems like Problem 48 illustrated in 
Figure 5, for instance, even when the program 
had discovered a usable combination of basic 
steps, ordering them in the precise way neces- 
sary to avoid the proscribed cells required some- 
thing the program does not possess, viz., a 
sharp sensitivity to patterns formed by these 
cells. The best the program could do on Prob- 
lem 48 is illustrated in Figure 16. 

Of course, the sharp sensitivity alone would 
be of little value. There would be no point to 
the program’s being able to recognize a pat- 
tern of proscriptions if this didn’t indicate 
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Figure 16. Machine Inability to Achieve the Precise 
Ordering Required to Solve Problem 48. 
Appearances notwithstanding, the program is a long 
way from attaining an ordering of steps which will 
move it through the tight part of the illustrated prob- 
lem field. Most of the program’s failures are attribut- 
able to weaknesses of its ordering procedures — despite 
the fact that these constitute the bulk of the program, 
(The illustrated attempt is in the FKC.) 
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something for it to do. Recognition of a narrow 
channel through which the successful path must 
proceed would have to suggest procedures for 
selecting the particular basic steps and an 
ordering of them which would effect a path of 
acceptable dimensions ; similarly for recognition 
of a required sharp right bend, or of a required 
detour to the left, or of a circumscribing rec- 
tangle within a certain size range and with 
entrance and exit required at certain places, etc. 
So the system would need a much more power- 
ful pattern-recognition capability than it pres- 
ently has ; but it would also need a system con- 
necting this capability with methods of 
behaving appropriately given certain patterns. 

It will be recalled that information about the 
order of basic transformations in a successful 
sequence is discarded by the present program 
when that sequence is abstracted and stored as 
a previously-proved “theorem.” There is no 
point to saving information about order when 
the pattern-recognition system used for re- 
trieval of theorems relevant to the solution of 
particular problems is able to recognize only 
direction and distance aspects of the problems 
and nothing about ordering constraints. How- 
ever, given the appropriate pattern-recognition 
capability there would be reason for saving in- 
formation about order. For our particular prob- 
lem domain (as well as for many others), one of 
the things which makes a problem difficult is its 
ordering constraints. Because our system is un- 
able to recognize clues involving these con- 
straints, and with these clues to retrieve rele- 
vant information from its file of past experi- 
ence, it falls down badly on problems involving 
severe ordering constraints. We need, then, two 
complementary things: improved pattern-rec- 
ognition capability and different kinds of in- 
formation saved about our previously-proved 
theorems. 

It might be contended that we are begging the 
question. Surely there are ways to satisfy 
severe ordering contraints other than the re- 
trieval and use of orderings previously worked 
out. To this charge we reply that there might 
well be such alternative ways but there appear 
to be sharp limits on how far one can go with 
them. This is related to the point we made 
earlier about the apparent necessity of theorem 
utilization for the mechanical synthesis of non- 


trivial, long sequences. One can, of course, 
write a more and more complex program, but 
at some point he runs out of programming time 
or out of program storage space in his com- 
puter. The limits we are talking about are prac- 
tical. Also, a point of special importance for 
artificial intelligence research (see Kelly and 
Self ridge®) , there is a serious question whether 
the more and more complex program would not 
necessarily have to be more and more ad hoe 
with respect to a particular problem domain. 
These possibly rash statements are at least 
partly based on our experience with the present 
program. 

We have tried a long list of techniques in- 
tended to increase the ordering ability of our 
program. Our hope has been to achieve a 
program of sufficient problem-solving capabil- 
ity that we could use it as a vehicle for experi- 
menting with alternate schemes for accumulat- 
ing, culling, abstracting, filing, retrieving, and 
applying theorems; and we did not want the 
ordering aspects of the problems in our prob- 
lem domain to enter into these inyestigations 
concerning theorems because of the very great 
difficulty of programming the pattern-recogni- 
tion mechanisms which would be required. 
Hence, our valiant try to get the program to 
handle the ordering aspects of its problems by 
means other than theorem utilization. The try 
has failed, however, and so we have not so far 
been able to experiment with alternate schemes 
of theorem utilization ; the reason is that 
changes in performance due to changes in these 
schemes are masked by effects due to weak- 
nesses in parts of the program which were sup- 
posed to perform their functions without 
theorem utilization. 

We salvage something from the failure if we 
learn a lesson concerning the importance of 
theorem utilization even for those parts of our 
problems where we thought we could get along 
without it. The reader will be helped in decid- 
ing whether he agrees with us that we can’t get 
along without it, if we indicate some of the 
things we did in trying to do so. We have al- 
ready described probably the most important 
thing, viz., having our program look for gaps 
through which a successful path apparently 
will have to proceed and use these gaps as bases 
for breaking a given problem into subproblems. 
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Several other things were tried, many of them 
rather difficult to implement. Actually the 
reason we conducted as many as four separate 
runs through the succession of 60 problems was 
that with each successive run we hoped (to no 
avail) that we had devised significantly im- 
proved ordering techniques. We tried a tech- 
nique for shortening paths, and thereby reduc- 
ing the likelihood of violating proscriptions, by 
replacing parts of unacceptable paths with 
theorems spanning the same gaps but involving 
fewer basic steps. This caused over-all per- 
formance to deteriorate ; the program did better 
by considering many alternative paths, rather 
than considering only a few and spending a lot 
of time on each of these trying to edit it into 
acceptable form. We tried techniques so easy 
as blindly reversing the order of steps in a path 
and so difficult as determining and effecting 
permutations which would usually move a 
bothersome part of a path from an area con- 
taining proscribed cells into an area contain- 
ing none or at least fewer. We programmed 
and tried a complex process intended to relax 
the ordering constraints on a problem by dis- 
covering pairs, alternative to the given start 
and goal pairs, into which the given pairs could 
easily be transformed but lying in an area 
allowing greater freedom of choice. We pro- 
grammed and tried a complex process which 
packed the steps of a path into an area of mini- 
mum dimensions. None of these things worked 
very well. The simple fact of the matter is we 
have been unable to devise and realize effective 
ordering techniques which make no use of pre- 
vious experience on simpler problems. 

It might be suggested that we have boxed 
ourselves in with the kind of theorem utiliza- 
tion that we do have, that rather than selecting 
theorems which determine the constituent steps 
of a path and then being faced with ordering 
and re-ordering these steps, a problem solver in 
our domain should select basic steps one by one, 
selecting a step only if this can be combined 
with previously selected steps in a manner that 
satisfies ordering constraints. It seems to us 
that the experiment reported above with our 
program in theoremless mode is relevant to an 
answer. In this mode the program has no alter- 
native but to select basic steps one by one, and 
it will be recalled what a difficult time it has 


merely selecting a long sequence of steps which 
will effect a closure let alone a sequence which 
satisfies severe ordering constraints as well. 

4. GENERAL CONCLUSIONS 

(4.1) The Significance of Theorem Utilization 
(Again) 

We want now to indicate the significance 
we find in our experiments for matters beyond 
better ways of programming a computer to 
move a knight around on a chessboard. First, 
let us repeat — ^for emphasis — our main point 
concerning the apparent importance of using 
previously-developed, defined transformations 
for tasks involving synthesis of long, order- 
constrained sequences of certain given basic 
transformations. That the ability to save and 
use previous experience in this way is impor- 
tant for a man is apparent enough to anyone 
who will carefully observe himself proving a 
theorem or writing a computer program. Per- 
haps we have gone some way toward deter- 
mining just how important such an ability 
might be for a problem-solving automaton as 
well. But we have said enough about this, 

(4.2) Importance of Pattern Recognition for 
Problem Solving 

Second, we would remark on the importance 
of pattern-recognition capabilities to effective 
theorem utilization and hence to effective prob- 
lem solving. The conclusion to which we are 
led is that a bottleneck presently preventing 
more powerful problem-solving machines is 
perceptual impotence. Our machines won’t be 
good problem solvers until they are good theo- 
rem utilizers, and they won’t be good theorem 
utilizers until they are good pattern recogniz- 
ers. Hard as this bottleneck might be to break, 
unfortunately it isn’t the only thing inhibiting 
progress. 

(4.3) Program Organization Difficulties 

Third, we would say something about some 
thorny problems of program organization that 
arise in connection with a complex heuristic 
program. We have discovered as have other ex- 
perimental programmers (e.g., cf. NewelP*) 
that a problem-solving program which breaks 
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problems into subproblems and recursively at- 
tacks these, breaks the subproblems into sub- 
subproblems and recursively attacks these, etc., 
will likely be a rather poor problem solver un- 
less there are some rather strict controls im- 
posed on its recursion. Such a program is likely 
to dig itself into a hole and never get up and 
out of the hole to a position from which it can 
recognize that it should have been digging in 
a different location in the first place. Perhaps 
even worse it is not able to relate the different 
subproblems which it defines at different levels 
of recursion to each other to discover overlap, 
duplication, and the possible significance of 
joint solution of subproblems quite separated 
from each other on the subproblem tree. And, 
finally, when it fails at some point and has to 
decide what subproblem to turn to next, obvi- 
ously it would be a better problem solver if it 
were able to base its decision on some kind 
of overview of the subproblem tree rather than 
turning automatically to the next subproblem 
one level up from the point in the tree at which 
it failed, but this is precluded in the case of 
uncontrolled recursion. We can report that all 
these difficulties (and more) arose in our case, 
and we have no general resolution of them to 
offer. However, it is perhaps worth reporting 
that we found very useful, indeed even indis- 
pensable, the technique of building up a tree 
of subproblem records (isomorphic with the 
tree of subproblems solved and unsolved, pend- 
ing and disposed of) which could be consulted 
by the program at any stage or level of recur- 
sion when deciding whether to create a new 
subproblem or whether to go to work on some 
previously created subproblem. Even though 
the techniques used for examining and compar- 
ing the records on this tree were quick and 
dirty, controlling recursion with the tree served 
to prevent much duplication of effort and mis- 
interpretation of the relative importance of 
particular subproblems. 

(4.4) Harder Problems (for Programs and for 
Programmers) 

Finally, something should be said about 
where we see the main difficulties if we are to 
implement an experience-utilizing system of 
the kind here discussed for proving theorems 
in a calculus of greater complexity and sophis- 


tication than the HKC or FKC, say for the 
propositional calculus or the predicate calculus, 
or for programming computers more powerful 
and interesting than vector adders. The first 
problem will be in determining a useful set of 
properties with which to categorize theorems 
or subroutines and a corresponding set with 
which to categorize problems and then, of 
course, developing pattern-recognition proce- 
dures which enable detection of these proper- 
ties. The second, and it will be recalled from 
our discussion above that we consider this a 
problem complementary to the categorization 
problem, will be determining how successful 
solutions should be abstracted, i.e., what kinds 
of information about them ought to be filed, and 
then, of course, developing procedures which 
can perform the desired abstraction. But these 
two problems are overshadowed by one of a 
kind which has not yet been solved, by us or by 
anybody else, even for systems so simple as the 
HKC and the FKC. This is the problem of gen- 
eralizing a number of particular, previously- 
developed solutions into a single solution 
schema. 

We haven’t said anything about this general- 
ization so far because our present program 
doesn’t do it, instead saving and using only par- 
ticular solutions. But skilled human problem 
solvers obviously aren’t so limited in their use 
of previous experience and much of their power 
derives from their ability to use general sche- 
mata which have been built from many sepa- 
rate, particular solutions. We can easily illus- 
trate the kind of general schemata we are 
talking about by looking at the HKC. Figure 
17 shows a human solution of Problem 58 in 
the HKC. (Problem 58 was much too diflicult 
for our program, in any of its several different 
versions, to solve.) Several parts of the suc- 
cessful path might have been constructed by 
using general schemata (and were so con- 
structed, if we are to believe the intro- 
spection of the human who solved the problem.) 
Thus in synthesizing the lower part of the path 
manifesting a regular alternation of the basic 
steps < — 1, — 2> and < — 1,2 >, the problem 
solver could have put to good use a general 
schema enabling left movement through any 
required distance within minimum vertical 
dimensions simply by alternately applying 
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Figrure 17. Human Solution (of Problem 58) 
Requiring Extremely Complex Ordering. 

The human problem solver used general schemata. 
Thus rather than moving from <12, 2> to <4, 2> by 
remembering and applying some combination of steps 
which effects a specific < — 8, 0> transformation, he 
used a general schema for moving left any distance 
required. (The illustrated solution is in the HKC; 
notice the allowed steps.) 

these two basic steps. As it now stands, the 
program saves from its experience much more 
specific information, viz., information about 
how to effect the left-moving transformations 
— 2,0]>, < — 4,0>, < — 3,2)>, < — 5,2>', 

< — 3, — 2>, < — 5, — 2>, etc. But it can not 
generalize this specific information into the 
schema enabling left movement through any de- 
sired distance. A human being apparently has 
no difficulty making such a generalization. In 
fact, even rats don’t have any difficulty in mak- 
ing a generalization as simple as the one illus- 
trated (cf. Krechevsky).!® But we are surely 
up against one of the hard problems of com- 
puter learning. 

For the present we are concentrating on 
methods of representing general schemata 
within a computer and methods of using them 
to solve problems. To take an elementary 
case, in the propositional calculus we are faced 
not with representing and using as independ- 
ent, isolated items the particular theorems 
(P Q) ('^Q ~P), (P ~Q) 


(Q ~P), etc., but with representing and 
using the general transposition schema under 
which are subsumed all four theorems with 
which one can transpose a conditional. The 
problems of machine representation and use are 
hard enough. Maybe their solution will sug- 
gest some ideas about how to solve the more 
interesting problem of getting a machine to 
generalize and establish such schemata by it- 
self, or at least with no more help than a human 
student has. But these problems of machine 
learning of general concepts are so hard as to 
suggest that maybe we ought to relinquish this 
faith that artifacts can be made to think. May- 
be we ought to become Platonists all. 
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ANALYTICAL TECHNIQUE FOR AUTOMATIC DATA 
PROCESSING EQUIPMENT ACQUISITION 

Solomon Rosenthal 
Directorate of Data Automation 
Headqu/xrters, U,S, Air Force 
Washington 25, D. C, 


In this paper we describe in considerable 
detail the ‘‘Analytical Technique for Automatic 
Data Processing Equipment Acquisition'’ de- 
veloped and documented with Mr. Ernest L. 
Holt and Mr. Joseph T. Averitt for the Direc- 
torate of Data Automation, Headquarters, U.S. 
Air Force. This is the first public description of 
this Technique and how to use it. 

The Technique is designed to facilitate com- 
petitive selection of Electronic Data Processilig 
Equipment regardless of the purpose for which 
the equipment is to be acquired. It is equally 
applicable for centralized or pick-your-own con- 
cepts of selection. 

There are four parts, each designed as part 
of an integrated whole. The parts are : 

I Instructions for preparation of specifica- 
tions for submission to vendors. 

II Instructions for submission of EDPS 
proposals. 

III Instructions for validation of EDPS 
proposals. 

IV Instructions for scoring of EDPS pro- 
posals. 

Detailed knowledge of the other three parts is 
not required to implement any one part, but to 
insure effective use of the Technique, those 
charged with the responsibility for its overall 
administration must be thoroughly familiar 
with all four parts. 


This system, including a weighted factor se- 
lection method, was designed to be flexible, im- 
partial, efficient and effective. Flexibility is 
achieved by permitting factor and weight modi- 
fications which best suit each individual selec- 
tion. To maintain impartiality the factors con- 
sidered and the weights assigned are varied 
prior to solicitation of proposals. The develop- 
ment of the technique was not influenced by ap- 
plications to be processed on selected equipment,, 
nor by specific equipment characteristics or con- 
siderations for any vendor. Equipment selec- 
tion time is shortened, the resulting selection is 
valid, and the documentation produced fully 
supports the selection. Due to the built-in 
flexibility, impartiality and efficiency of the 
technique, it is acceptable to the ultimate user 
of the selected equipment, to approval authori- 
ties, and to vendors — in short, it is effective! 

PART I 

The “Specifications Module,” or “Specs for 
Specs” as some like to refer to it, was developed 
to provide a standard arrangement, organiza- 
tion and approach to the problem of preparation 
of the EDPS specifications for submission to 
vendors. A more complete description would 
provide a table of contents for specifications, 
and detailed instructions for the contents of 
each chapter and paragraph. Here,, rather than 
present the prescribed format, we will discuss 
the material to be included in the specifications. 
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Specifications prepared in accordance with the 
standard format provide sufficient information 
about the job to be done to permit interested 
vendors to submit proposals that are truly re- 
sponsive, and also to establish for the scorer a 
firm basis for application of the scoring module 
to be described later. 

Prospective bidders must be fully informed 
about intended locations and the user’s organi- 
zational mission and objectives. The user’s 
plans for conversion from any existing system ; 
for centralized or decentralized programming 
in the case of multiple installations ; his equip- 
ment delivery and implementation schedule; 
and his intention to pilot test before ordering 
more than one system, are all bits of informa- 
tion a vendor needs to know before investing in 
a proposal. 

Any conditions or minimum standards, with 
which the vendors must comply, must be docu- 
mented. These specific requirements, or restric- 
tive criteria, should be held to a minimum, and 
must not be designed to eliminate any vendors 
from consideration. If these rules are followed, 
it seems reasonable to state that non-compliance 
with any of these specifics could result in a pro- 
posal being eliminated from competition. Re- 
strictions might justifiably apply in such areas 
as: 

Delivery (Dates and Places) 

Cost (Maximum Funds Available) 

Timeliness (Allowable delay between remote 
station inquiry and response, etc.) 

Support (Programming, maintenance, sys- 
tems analysis, etc.) 

Physical Environment (Room size, controls, 
floor) 

Equipment Characteristics (Print positions, 
character sets, random access storage, etc.) 

Compatibility (With data banks, programs, 
other interface requirements) 

Expansion Capability (Anticipated work- 
loads) 

Software (Monitors, executives, scientific 
subroutines, common languages, etc.) 

Others 

We must remember that to qualify for inclu- 
sion in the above list an item must be a firm, 
unalterable requirement. 


The heart of a specifications package is that 
part which documents the job to be done. For 
each data processing task, a statement is re- 
quired which fully describes a representative 
sample of the total workload of that type. As 
large a percentage of the total as is practicable 
must be stated for each task. A sufficient num- 
ber of task statements must be made to in- 
dicate collectively the requirement for all com- 
puter characteristics involved, i.e., an adequate 
number to enable determination of the configu- 
ration and capacities needed to accomplish the 
tasks. 

A complete statement includes a general de- 
scription of the job and the functional areas 
involved (personnel, materiel, accounting, sci- 
entific research, etc.). All processing require- 
ments must be described in narrative form. All 
items, documents, files, master files, tables, etc., 
which are to be used during — or produced by 
each processing step, should be discussed. Sort 
runs, edits, and so forth should be explained. 
Flow charts, consecutively numbered, describ- 
ing the data system must be prepared. Each 
processing step must be identified and numbered 
to correspond to numbered paragraphs in the 
narrative. All inputs to, and outputs from, 
every chart must be clearly labeled to show their 
source or destination and to tie all the charts 
together. All inputs and outputs must be de- 
scribed in terms of the number of Alpha and 
Numeric characters contained. If sample docu- 
ment forms are available they should be in- 
cluded. 

A key item in our system is the “Workload 
Statement Chart’’ which must be supplied with 
the specifications; it provides a tabular sum- 
mary of the information contained in the job 
statement just covered. A separate page or 
chart should be produced for each application or 
task. A more complete description would leave 
nothing to the imagination in the instructions 
for completing this chart. Here, we merely pre- 
sent a recommended “Workload Statement 
Chart” (Figure 1). Just one word of caution; 
the entries must serve to explain what came be- 
fore, not to confuse the reader. In other words 
it should be simple for anyone to relate each 
entry to a corresponding part of the narrative 
and applicable flow chart. 
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Fignire 1. Workload Statement Chart. 


The final entry on the ‘‘Workload Statement 
Chart'' for each task should be “The above state- 
ment represents ‘X'% of the total workload of 
this type." The final entry on the last chart 
should be “The above statement (s) repre- 
sent (s) ‘Z'% of the known total workload." 
The percentages indicated will be used later in 
proposal preparation to project an estimate of 
the total time each proposed equipment will re- 
quire to accomplish the total requirements. 

In most cases it will be advantageous to give 
prospective bidders a description of the intended 
site(s). Include such information as available 
floor space, loading factors, maximum distance 
allowable beteen main computer and remotes, 
etc. To enable the vendors to propose a com- 
plete system, including power converters, regu- 
lators and other items needed, tell them what 
kind of facilities exist. You might also wish to 
tell them your current inventory of such things 
as magnetic tapes and control panels to give 
them a basis for quoting some trade-in values. 

PART II 

The “Proposal Module” serves a dual purpose. 
First, it provides those requesting a proposal 
with a complete guide to designing that request. 
Second, it provides the vendors with a “recom- 
mended" standard arrangement and indexing of 
the many subjects to be covered in a proposal. 
Therefore this module should have two prefaces 
— one to be read by those preparing the request, 
the other to accompany the request, to be read 
by those responding with proposals. 


These prefaces should include. 

In the First : 

This part is used in preparation of instruct 
tions to vendors. These instructions must be 
tailored to the requirements of the particular 
selection by those forwarding specifications 
and requests for proposals to vendors. 

In the Second: 

We request that your proposal be prepared as 
outlined in the attached “instructions for pro- 
posal organization." 

This format is not intended to restrict the 
contents of your proposal. The entire con- 
tents of Chapters I thru IX must relate to the 
specific configuration of equipment on which 
your proposal is based. 

Your proposal should include only that one 
equipment configuration adjudged by you to 
be the best for the purpose of this selection. 
If you consider it necessary to propose more 
than one configuration, each must be sub- 
mitted as a separate and complete proposal 
(Chapters I thru IX). 

Following Chapter IX, you may furnish any 
additional chapters which you consider perti- 
nent to your proposal. 

When proposals are prepared in a standard 
format a giant step has already been taken to- 
wards simplifying the comparison of proposals. 
It will be possible to pull from each proposal a 
chapter or paragraph, a prescribed chart or 
table, and compare the proverbial apples with 
apples, not with oranges or mangoes. 

The standard format in which proposals 
should be submitted, and the kind of informa- 
tion to be included, is : 

Chapter I General 

A letter of transmittal and a brief summary, 
for executive review, of the highlights of the 
proposal. 

Chapter II Responsiveness 

A statement of the ability to satisfy the re- 
strictive conditions or minimum standards 
given in the specifications (see Part I — The 
“Specifications Module"). 

Chapter III Equipment configuration and costs 
A table (Figure 2) showing all components of 
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Figure 2. Equipment Configuration and Costs. 

the proposed system with complete pricing 
information. 

Chapter IV Systems and Timing 

1. An explanation, with flow charts, of the 
vendor’s approach to the solution of the prob- 
lem, when the approach constitutes a depar- 
ture from that given in the specifications. (In 
some cases, vendors are asked to design or 
redesign a data system.) 

2. Tables (Figures 3 and 4), called “Timing 
Estimate — Detail” and “Timing Estimate — 
Summary.” These tabulate all time required 
by the main processor and associated periph- 
eral devices. When these tables are com- 
pleted properly, it is easy to compare them 
with the “Workload Statement Chart” (Fig- 
ure 1) and to determine whether or not each 
step has been timed. There must be line en- 
tries on the “Timing Estimate Charts” for 
corresponding entries on the “Workload 
Statement Charts.” 



Figure 3. Timing Estimate Stated Workload. 
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Figure 4. Timing Estimate. 

Chapter V Questionnaire 
A questionnaire tailored by the requestor to 
the requirements of the submitted specifica- 
tions and characteristics of each component 
structions make it clear that every question 
must be answered, and that each question 
will be answered with a number, a percentage, 
a price or other such specifics. 

Chapter VI Equipment Factors 
Technical literature giving detailed descrip- 
tions and characteristics of each component 
of the proposed configuration. Requirements 
for site preparation should be made known 
here. 

Chapter VII Programming and Software 
A list and explanation of the software pack- 
ages available for use on the proposed equip- 
ment, with emphasis on those particularly 
useful in the work to be performed. 

Chapter VIII Support 

1. A description of the training available 
which meets the requirements of the pro- 
spective user. 

2. A general discussion of the proposed main- 
tenance plan, including a specific quotation of 
required preventive maintenance. 

3. A statement about back-up equipment 
available for use in emergencies. 

Chapter IX Contract 

A copy of the contract (GSA schedule in the 
case of Federal Government), and any con- 
tractual deviations proposed for negotiation 
which would apply to the proposal submitted. 

When both specifications and proposals are 
prepared in a standard way, vendors know what 
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their equipment, if selected, would be expected 
to do, they can prepare their proposals most 
efficiently and economically, and a sound basis 
for objective comparison and selection is estab- 
lished. 

PART III 

The ''Validation Module’' also serves two pur- 
poses. It provides the technicians with stand- 
ard sidelines for determining the proposal’s 
compliance with the specifications and its ac- 
curacy and adequacy. The validators are not 
charged with determining which of the pro- 
posed equipments is best but with determining 
whether or not the proposals being validated 
are acceptable. They do not compare one pro- 
posal with another. 

First the specifications package and the in- 
structions (tailored for the particular acquisi- 
tion) to the vendors are read. Next the pro- 
posals are examined. After the validator is 
familiar with the specs and the proposal which 
he is to validate, he is ready to check all state- 
ments, answers to the questionnaire, tables and 
charts in a vendor’s proposal. Claims about 
storage capacities, simultaneity, time estimates, 
and special purpose devices are some areas re- 
quiring detailed consideration. 

The validator must have at his disposal a 
comprehensive library of other publications 
from the vendor whose proposal is being 
checked, recognized technical publications from 
other sources and a vast storehouse of personal 
knowledge and experience in both the equip- 
ment and subject matter fields. 

During this process the validator may deter- 
mine that a proposal does not comply with one 
or more of the requirements, is not adequate to 
accomplish the tasks described in the specifica- 
tions, or that some inaccuracies exist in the pro- 
posal. When any such determination is made, 
the validator is required to prepare a statement 
containing the specific area or portion of the 
proposal considered unresponsive, inadequate 
or otherwise in question. This statement, with 
recommendations for correction or elimination 
from further consideration, goes to a reviewing 
authority for decision. If the reviewing au- 
thority concurs with the validator’s statements 
concerning non-compliance or inaccuracies, the 
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Figure 5. ED PE Evaluation Sheets. 

submitting vendor may be called ii^for consul- 
tation and mutually agreeable changes. 

This module also contains the first pass of 
the "EDPE Evaluation Sheets” (Figure 5). A 
complete set of evaluation shetts is processed 
for each proposal submitted. These sheets will 
already have beei\ modified to reflect the specific 
list of factors to be considered for the acquisi- 
tion at hand. The number and types of factors 
are, for all practical purposes, infinitely varia- 
ble. There are only two restrictions on the 
types: (1) Each must represent an item having 
a bearing on the applicable merits of a proposed 
configuration; and (2) it must be numerically 
rateable. There is no allowance in this system 
for evaluating the color of a sales engineer’s tie, 
or the "reputation” of his employer, or "The 
one I got was a lemon — ^they’re all no good” line. 
The tailoring could have been done as early as 
the time the specifications were sent to the 
vendors, but may be done at any time prior to 
receipt of proposals. 

The validators, who do no actual scoring, post 
all entries in the factors section of these sheets. 
Every blank must be filled in with a number, a 
check ( V ) for yes or no, an N/A or other appro- 
priate specifics as instructed. The basic sources 
for all data entered, except that resulting from 
computation, are the proposals and available 
technical publications. 


Minor i Intar 
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At the end of this phase the evaluation sheets 
contain data which is accepted as accurate and 
complete. These sheets then become the inputs 
to be compared and scored. 

The validators must also prepare a written 
summary of each proposal. These reports must 
include actual costs and processing times, and 
differences between the proposal and the specs 
with respect to systems approach. 

We realize, that for any given selection, those 
determining which factors to consider may over- 
look some items. A validator might feel that the 
list of factors completely ignores some valuable 
features of a particular proposal. In the report 
he may discuss any pertinent factors found in 
that proposal which are not reflected in the 
evaluation sheets. The report may also be used 
as a medium for a “sales pitch” for or against 
a proposal. This report, with the scores, will be 
considered in the final analysis by those making 
the selection. 

PART IV 

The “Scoring Module” is a technique for scor- 
ing validated proposals. Factors, already dis- 
cussed in Part III, relative weights, and detailed 
instructions for the scoring, all designed for 
each specific selection, must be completely docu- 
mented before proposals are in. While each se- 
lection to be made may be based on a different 
set of factors and different weights, the method 
of applying these weights does not vary. 

The inputs to the scoring scheme are the 
“EDPE Evaluation Sheets” (the factor section 
completed) and the weights and instructions 
for their application. The output is an “EDPE 
Evaluation Summary” (Figure 6). 

The scorers accept the validators’ entries on 
the input side as gospel and are only concerned 
with comparing and scoring, not with valida- 
tion. The scorers are required to complete the 
comparison base section of the evaluation sheets 
and to compute the minor, intermediate, major 
and total scores as dictated by the weights and 
formulae supplied. This approach permits a 
rapid, unbiased selection, and the scorer’s data 
added to that of the validators provides the 
much needed, formalized backup documentation 
sure to be required by the final- authorities. 
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Figure 6. EDPE Evaluation Summary. 

Let’s look at a hypothetical set of factors. 
There might be any number of major categories, 
including overall cost, equipment characteris- 
tics, system potential, vendor’s support and 
others as required. Within the cost category 
we might look into the actual cost of the equip- 
ment on both a lease and a purchase basis, the 
costs of operating, and installing. The charac- 
teristics checked might cover: capacities for 
program and data storage ; compatibilities 
within the equipments proposed and with data 
banks; reliability; and total time required to 
process the whole job spelled out in the work- 
load statements. System potential could cover 
the investigation of the capability to expand the 
workload with or without additional equipment. 
Important aspects of vendor’s support might 
be in the training, maintenance and software 
areas. 

This is a good time to explain what appear 
to be contradictory statements. In discussing 
the validation phase we said we do not rate the 
vendor’s reputation. In this phase we say we 
might rate reliability. Reliability in this sense 
means the designed methods of insuring no 
errors or provisions for error detection and 
correction. There could be three types of re- 
liability insurers — validity checks (does each 
bit structure represent a “legal” character in 
that system?), parity checks (does each char- 
acter contain the proper number of bits?), and 
accuracy checks (does each character moved 
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compare with each character to be moved?) . A 
possible accuracy check which no one yet em- 
ploys is optical scanning of a printed line on 
high speed printer output and a comparison 
with the data that was to be printed and is still 
stored in an output buffer. 

We will next develop the ‘'Comparison Base’’ 
entries. For each factor to be compared we 
select the “best” vendor’s figure. In the case of 
costs the lowest is obviously best as is the high- 
est figure in speed areas. The best is then en- 
tered on the appropriate line of the evaluation 
sheets. It is now possible to compare each 
vendor’s figure with the best figure proposed 
for each item. 

Before we go into the actual scoring opera- 
tion we will discuss the weighting pattern. We 
will not presume here to even suggest what the 
weights should be for any factors. The user’s 
requirements alone are the governing element 
in establishing relative weights as they were in 
developing the factors. 

The maximum score possible for any proposal 
is 100 points. This score, by the way, can only 
be achieved if one proposal is “best” in every 
factor — bar none! We take a deep look at the 
major categories of factors and assign a part of 
the 100 to each based on their relative impor- 
tance. We then determine what part of the 


maximum score possible for each major cate- 
gory is proper for each intermediate within that 
major. We then equate the intermediate maxi- 
mum possible to 100 and assign a portion of this 
to each minor within each intermediate. 

Another way of putting it is : the sum of the 
maximum points possible for all the major cate- 
gories equals 100. The sum of the maximum 
points possible for all the intermediates within 
a major equals that part of 100 assigned to 
that major. The sum of the maximum points 
possible for all the minors within an intermedi- 
ate equals 100% of the maximum possible for 
that intermediate. 

Now that we understand the weighting pat- 
tern we can look at the scoring itself. All of the 
minor scores for a proposal are developed, then 
the intermediates, the majors and the total, in 
that order. Minor scores are developed by either 
of two methods. Some minor factors are not 
compared, one proposal with another, but war- 
rant a predetermined score if certain conditions 
exist. For instance, if free maintenance is pro- 
vided 24 hours per day, “X” points may be 
awarded, if only for 8 hours per day, a lesser 
score is assigned, and so forth. 

The second method applies to the majority of 
the minors. If the SMALLEST is BEST, the 
score is computed with formula A : 


(Comparison Base) (Maximum Possible) 
(“This Vendor” Entry) 


= Minor Score 


If the LARGEST is BEST we use formula B : 


(“This Vendor” Entry) (Maximum Possible) 
Comparison Base 


— Minor Score 


The next step is to total all minor scores lating the intermediate scores. These are com- 

within an intermediate in preparation for calcu- puted in this way : 


(Minor Total for “This Vendor”) (Maximum Possible) 

100 


= Intermediate Score 


The intermediate scores within each major 
category are totaled to produce the major scores 
and the final scores simply require the summing 
of all the major scores. The scoring hierarchy, 
then, looks like Figure 7. 


As shown, the final score is derived from the 
major scores, the majors from the intermedi- 
ates and the intermediates from the minors. 

The final task of the scorers is to transcribe 
the intermediate, major and final scores from 
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each evaluation sheet set to the EDPE Evalua- 
tion Summary (Figure 6). The completed 
summary and the validators’ reports, discussed 
in Part III, become the basis for an intelligent, 
educated decision by a selection group. 

Several built-in safeguards further guard 
against the introduction of personal prejudices 
and political interests in the entire package 
from the writing of specifications through the 
recommendations by a selection group to the 
final authority. These tricks of the trade may 
not be obvious in what we have said so far so 
let us point out a few: 

1. Those who write the specifications (the 
ultimate user of the selected equipment) 
have nothing to do with scoring the pro- 
posals. They are consulted and their sug- 
gestions may be incorporated in establish- 
ing the factors and relative weights. 

2. Those who validate proposals never know 
the weights assigned to any factors. 

3. Those who score do not know which ven- 
dor’s proposal they are processing. They 
really don’t process a proposal at all. They 
score a coded set of evaluation sheets. 



Figure 7. Scoring. 


4. Those establishing factors and weights 
can not be influenced by proposals re- 
ceived since their work must be done be- 
fore proposals are due. 

5. Those responsible for the final decisions 
have a sound basis for their decisions and 
might find it difficult to pick anything but 
the best overall system proposed. 

CONCLUSION 

The Analytical Technique, as described, is an 
integrated system which addresses itself to all 
of the important steps leading to an acceptable 
selection. Other methods of evaluating indi- 
vidual hardware systems have been designed. 
These other methods could be used as most valu- 
able aids during the validation of proposals. 
Such assistance can serve to make the valida- 
tor’s work less tedious and more accurate. At 
least one system we know mechanizes a good 
portion of this area and might be especially 
useful in checking the timing estimates pro- 
posed. 

Other weighted factor schemes for selection 
are in use today. Unfortunately, these are not 
necessarily based on actual requirements, nor 
are scores necessarily computed the same way 
for all selections. 

Standardized data systems specifications and 
vendors’ proposals, comprehensive validation, 
and standard computational procedures, are 
treated in detail and as a single package in our 
technique. 

It is possible to mechanize the scoring portion 
but the short time required to score manually 
does not seem to warrant the programming and 
debugging effort. 

This technique has been used a number of 
times by the Air Force, and it has provided 
valid selections based on requirements. 
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INTRODUCTION 

The Cost-Value Technique of computer evalu- 
ation evolves from the computer evaluation 
and selection techniques used during the last 
twenty years. The Cost- Value Technique has 
benefited by the frustrations, errors, and weak- 
nesses of countless selections; however, each 
selection has added some knowledge to the 
growing problems of computer acquisition. 
Each forward step was accompanied by a tech- 
nique that was more realistic and meaningful. 
I consider the Cost-Value Technique a natural 
consequence of evolution, the successor to the 
Weighted Factors Selection Method, or any 
other known existing evaluation technique. 

The Cost-Value Technique is an evaluation 
technique which attempts to make the function 
of computer selection more meaningful and 
understandable by eliminating some of the 
major weaknesses found in most of the evalua- 
tion techniques which are in use today. This 
is not to say that the Cost- Value Technique of 
computer selection eliminates all the problems 
found in any other type of objective ev^uation 
and selection technique. All it tries to do is 


make the difficulties a little easier to overcome, 
the values assigned a little more realistic, and 
the resulting selection a little more meaningful 
and understandable. It does this through the 
incorporation of two important principles : 

1. First, in an effort to keep the selection 
simple and straightforward, the Cost- 
Value Technique recognizes only two 
categories of factors: 

a. Costs; which are a function of the 
equipment costs multiplied by the 
time required to complete the appli- 
cation, and many other cost items, 
and 

b. Extras; which are any items of 
value that are inherent in the costs 
of one system proposed, but not to 
all systems proposed, and do not 
directly influence the system's run- 
ning time. Thus, extra maintenance 
service or extra expansion capa- 
bilities which are procured as part 
of the basic system are extras, but 
equipment characteristics like speed 
of central processor and peripheral 
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equipment, or memory size, are not 
extras since they directly influence 
the running time which in turn di- 
rectly influences the cost of the sys- 
tem. 

Thus, an item is only evaluated once, 
either by its influence on cost — directly 
or via the running time of the system — 
which in turn is a cost factor, or by its 
value as an extra. 

2. Second, by using dollar cost or value as 
the basis for scoring all the ‘extras' 
oifered, a common denominator is ob- 
tained by which values can be understood, 
discu^ed, and changed independent of 
each other. Also, the ‘extras' scores can 
be directly related to the ‘costs' scores. 

The significance of these two principles might 
be better understood if before going into a dis- 
cussion of the Cost-Value Technique, we took a 
minute out to consider objective evaluation 
techniques in general, and the Weighted Fac- 
tors Selection Method in particular. 

Major Premises and Purpose 

Objective evaluation techniques for computer 
selection are based on the following three 
premises : 

1. The ‘costs' considered in the evaluation 
process should be those costs which are 
associated with securing and maintaining 
the computer system equipment and the 
support necessary to satisfy the require- 
ments of the specified applications. 

2. The major ‘value' sought in any com- 
puter selection technique is the ability 
of the selected computer to satisfy the re- 
quirements of the applications specified. 

3. Some important ‘extras' of value such as : 
excess expansion capability, back-up 
available across the street, proven 99.5% 
reliability, etc., will also be included by 
some vendors in the costs of the system's 
equipment and support proposed to sat- 
isfy the specified application require- 
ments. These ‘extras' will be offered in 
varying amounts by the various partici- 
pating vendors. These ‘extras' are im- 
portairt to the selection and, therefore, 
their ‘value' should be evaluated. 


The purpose of any objective evaluation tech- 
nique might be compared to a series of weight 
measurements on a set of balance scales. For 
each ‘weighting' (evaluation) the ‘costs' neces- 
sary to obtain the computer system equipment 
and support necessary to satisfy the require- 
ments of the specified application, as proposed 
by each vendor, are weighted against the ‘value' 
of having the application completed and the 
value of the extras offered by that vendor. The 
objective of the comparative evaluations is to 
select the ‘weighting' (evaluation) that shows 
the scale tipped most in favor of the ‘value' 
side. 

These are the same three premises and pur- 
poses that are used in the Cost- Value and the 
Weighted Factors Selection Method or any 
other objective evaluation technique. The dif- 
ference between techniques is in the ways in 
which these extras are evaluated. 

Weighted Factors Selection Method — Pro and 
Con 

The Weighted Factors Selection Method is 
one of the better evaluation techniques in use 
today. It recognizes the need for evaluating 
the ‘extras' as well as the standard cost items 
and does so by assigning numerical values or 
point scores to all items, ‘extras’ as well as 
systems costs. In my opinion, however, there 
are still two major weaknesses in the Weighted 
Factors Selection Technique: 

1. The technique is likely to give an absolute 
weight or score to too many factors and 
then to score the details within each 
factor in too many different ways, i.e., 
points for speed of central processor, 
points for speed of input/output devices, 
points for time required to complete the 
full job, points for slack time remaining 
for expansion, etc. Thus, the same item 
(i.e. speed) may be awarded points in a 
number of different ways and for many 
seemingly different reasons. Soon it be- 
comes very difficult to determine the true 
worth or influence of that one item on 
the final selection. 

2. The assignment of point scores to specific 
elements of an evaluation would probably 
accurately reflect the experiences and 
knowledge of the evaluator who selected 
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the elements in either ascending or de- 
scending order of importance and then 
associated each with a fixed numerical 
value. Arbitrary, perhaps, but what else 
may the evaluator rely on save what he 
has learned from experience to be im- 
portant or, conversely, not important? 
And, moreover, how can he avoid giving 
that specific measure of value to elements 
of the evaluation that reflect in his per- 
sonal, subjective opinion, their true 
worth? And, how does he assign scores 
and identify important elements ? An 
example of such a point assignment is 
shown in Figure 1. So subjective is this 
system that “X” number of evaluators, 
if requested to assign point scores to a 
list of elements considered important to 
a computer selection, would produce “X” 
number of lists; each selective, each the 
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5 



Cards 
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10 


Magnetic Tape 
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Printers 

3 



Other Peripherals 

2 



Reliability 


oO 


Special Features 


10 


Problem Timings 


15 


Central Processor Limited 

2 



Input /Out put Limited 

8 



Balanced 

"3 



Other Characteristics 


5 


EXPANSION POTENTIAL 



200 

Slack Time 


150 


Central Processor 

50 



Peripheral Equipment 

100 



Maximum Expansion 


50 


Memory 

25 



Peripheral Equipment 
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SYSTEM SUPPORT 
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Program Assistance 


10 


Develoinnent 

5 



Writing 

3 



Converting 

2 



T raining 


10 


Maintenance Offered 


10 


Program Testing 


20 


Existing Software 


50 


Sort/Merge 

10 



COBOL 

10 



FORTRAN 

10 


1 

Report Generator 

10 



Other 

10 




best from the evaluator’s point of view, 
and each different. 

Minor differences would be understandable, 
but the differences are not likely to be minor. 
The magnitude of the differences can be 
glimpsed in the startling contrasts of two dis- 
tinct groups which are presently working their 
way to management positions. One group has 
a background in Finance and the other in Engi- 
neering. If these two groups were asked, in- 
dependently, to use the Weighted Scoring Tech- 
nique to evaluate a computer selection, I suspect 
the results would be very interesting. Joined 
as they may be in a common cause, identifiable 
principally, perhaps, with corporate well-being, 
they still would regard the method in much 
different ways and in all likelihood the major 
weights assigned by these two groups would 
differ considerably. Shown in Figure 2 is an 
example of the way these two groups might 
assign weights to the four major categories 
listed in Appendix I. 

How do you reconcile such differ- 
ences of opinion? Moreover, does any 
reconciliation or compromise improve 

QT* anViQTiPO Q/»r»n'nfQ'hili+xr rvF I'/ianlf- 

VJ' VJL & WO VIA «/— 

ant ‘weighted scores’ ? 

The difficulty — ^weakness — if you prefer, in 
the Weighted Factors Selection Method is that 
points assigned to the cost section cannot be 


SAMPLE WEIGHTINGS OF MAJOR CATEGORIES 


Categories 

Group With 

Financial Background 

Group With 
Eng. Background 

Cost 

85% 

25% 

Equipment Characteristics 

5% 

25% 

Expansion Potential 

5% 

25% 

System Support 

5% 

25% 


Figure 1. 


Figure 2. 
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reconciled to the points assigned to the equip- 
ment characteristics section or to any other 
section, or even from item to item within a 
section. In other words, as the four categories 
presently exist in the Weighted Factors Selec- 
tion Method, there is no common denominator 
between categories, thus there is no meaningful 
way of relating points from one category to 
another, or for that matter, between the ele- 
ments within a category. 

COST-VALUE TECHNIQUE— GENERAL 

The Cost-Value Technique, like the Weighted 
Factors Selection Method, also recognizes the 
necessity of evaluating the ‘extras’ offered in 
various computer systems. The Cost-Value 
Technique, however, is unique in its handling 
of these ‘extras.’ The Cost-Value Technique 
studies any extras offered in the proposals to 
determine whether the claimed ‘extras’ are 
truly important extras or mere incidental ele- 
ments that appear to be extras. For instance, 
a sixty-nanosecond memory and a ten-thousand- 
cards-a-minute reader in themselves are not 
important extras, if extras at all ; the important 
extra is how much slack time exists in the pro- 
posed system because of these high-speed units. 
For every ‘extra’ considered important, a study 
must be initiated to determine the value for the 
extra. The really distinguishing feature of the 
Cost-Value Technique is in the assignment of 
the value associated with these important 
extras. The value assigned is in terms of costs 
(dollars). 

A brief explanation may be helpful at this 
point because the significance of the value as- 
signment in cost may not be readily apparent. 
By ‘simply’ assigning a cost-value to the various 
important ‘extras’ offered by the various ven- 
dors, a technique has been found to overcome 
the seemingly insurmountable problem (no ob- 
jective method of relating points assigned the 
various items evaluated) of the Weighted Fac- 
tors Selection Method. A common denominator 
has been established with which all , offered 
extras may now be related. Although the cost- 
values assigned to the various extras will still 
be a matter of each individual selection and will 
continue to refiect the opinions of the cost-value 
assigners, a value when assigned can be under- 
stood, examined, discussed, and changed in- 


dependently of all other individually assigned 
values. Another very important benefit de- 
rived by the use of cost assignment of value in 
the Cost-Value Technique is that management, 
for perhaps the first time in the twenty-year 
history of computers, can understand what is 
going on in an evaluation and is able to make 
informed decisions on the value of any disputed 
extras. 

The cost-values established for the various 
extras found within a proposal are assigned 
as ‘credits’ to that proposal (weight added to 
the value side) . This allows each proposal sub- 
mitted to be evaluated by taking its total ‘out- 
of-pocket’ cost and subtracting the cost-value 
‘credits’ awarded that proposal to find the dif- 
ference. The proposal having the smallest dif- 
ference (since the ‘value’ of having the appli- 
cation completed remains constant for all ven- 
dors) automatically becomes the proposal se- 
lected by the evaluation technique as being 
most advantageous in terms of the amount of 
systems equipment and support that is being 
obtained for the money spent. 

An example of how the Cost-Value Technique 
might be applied is shown in Figure 3. 
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Having now considered the three major 
premises, the purpose of an objective evalua- 
tion technique, and the distinguishing feature 
(value assigned in cost terms as credits and 
fewer factor categories used) of the Cost-Value 
Technique, let us proceed to a detailed examina- 
tion of the previously mentioned cost-value 
study of all extras offered. 

COST-VALUE STUDY— WITH EXAMPLES 

The Cost-Value Technique’s approach to the 
‘extras’ offered by the vendors is to appraise 
the offered extras to determine whether they 
are worthy of inclusion in the evaluation, and 
if so, to determine the cost-value of these 
extras. To avoid any bias, or appearance of 
bias, on the part of the evaluators, this study 
should preferably be initiated before the pro- 
posals are received. It thus becomes necessary 
to deal with hypothetical or realistically antici- 
pated extras. A sample listing of items some- 
times considered as ‘extras’ will be helpful to 
our study and for that purpose a sample listing, 
similar to one that might be found in the 
Weighted Factors Selection Method of evalua- 
tion as shown in Appendix I will be used. The 
sample will be used as a beginning for our 
study of the essentials and extras offered in 
computer system proposals. 

The list shown in Appendix I is arranged 
into four categories : Costs, Equipment Charac- 
teristics, Expandability Potential, and System 
Support. Other groupings of items could also 
be used. Some of the items, like those under 
costs, are not extras, but are included because 
they must be evaluated in selecting a computer 
system. In the following sections, each of the 
listed categories will be studied ; one of the re- 
sults of our study will be the identification of 
these items and categories that are considered 
important in the Cost-Value Technique of com- 
puter selection. 

Before examining each of the four categories, 
shown in Appendix I, these words of caution 
and care are furnished. 

1. Methods described herein for cost-value 
determinations are by no means the only 
methods that could be used. 

2. There is nothing sacred in any of the cost- 
values established in this report since the 


value of any item depends upon the likeli- 
hood of the user’s need for that item. For 
example : 

a. If the described system is to be used 
only for one or two applications, 
and the size and volume of these 
applications are fixed, then the 
cost-value of Expansion Potential is 
likely to be nil. On the other hand, 
if the described system is the first 
system to be installed in a growing 
company, the cost-value of Expan- 
sion Potential will be very high be- 
cause every hour of available ex- 
pansion might be regarded just as 
valuable as each hour in actual use. 

b. If the described system is to replace 
an existing, compatible system, the 
cost-value of some of the Sys- 
tem Support items like ‘personnel 
loaned’ or ‘program assistance’ may 
have no cost-value. If, however, the 
computer is for a relatively inex- 
perienced group, ‘personnel loaned’ 
or ‘program assistance’ might each 
have a cost-value as high as, or 
higher than $20,000 a man year. 

Another general word of explanation before 
getting into the details of cost-value assign- 
ment. It will be of help to discuss the idea of 
cost-valuing the extra. Thus, if four vendors, 
A, B, C and D, respectively, offer 100 hours, 
125 hours, 50 hours and 70 hours of program 
checkout time, then the amount of ‘extra’ to 
which a cost-value should be affixed is 50 hours 
for A, 75 hours for B, and 20 hours for D. The 
extra for each item considered for each vendor 
is the amount offered by that vendor for that 
item, minus the minimum amount of that item 
offered by any vendor. If the extras offered 
are in different terms: Vendor A offers 100 
hours of prime shift time, prior to delivery for 
checkout; Vendor B offers 125 hours, any shift, 
after delivery, for checkout; Vendor C offers 
50 hours of third shift time, prior to or after 
delivery, for checkout — ^then all the items have 
to be converted to their cost-value first, and 
then their difference taken, to determine their 
‘extra’ cost-value. 

The four categories found in Appendix I 
and to be considered now are Costs, Equipment 
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Characteristics, Expandability Potential, and 
System Support. 

Cost Items 

All cost items should be considered in the 
evaluation. Items such as the cost of supplies 
or personnel costs may prove to be non- 
differentiating (no significant difference found 
between vendors) in a given selection, but they 
should not be deleted from the evaluation list 
until their costs have been found to be truly 
non-differentiating. 

Treating cost items as one-time costs or con- 
tinuing costs is a matter of cataloging. Two 
rules must govern any proper treatment of cost 
items : The costs must be spread proportionately 
over the expected life of the system, and the 
system costs must change to reflect the costs 
of any planned system expansion. If, for ex- 
ample, the life of a system is set at six years 
and if a uniform expansion rate of 10% a year 
is expected over the life of the system, then 
each of the cost items on the list should be 
charged (if applicable) to the yearly system 
cost for six years; and it would be expected 
that the equipment cost for the sixth year would 
be larger than the equipment cost of the second 
year. 

Another important consideration relating to 
cost items is that they should show the cost for 
individual pieces of equipment to be used. This 
should be done in all cases except when the 
system is to be used for less than one shift or 
when the entire system is to be purchased. The 
break-out shows which equipments are actually 
to be used for more than one shift (higher 
rental). The breakout also indicates which 
items might be more favorably leased than pur- 
chased under a split acquisition. 

No cost items should be duplicative; that is, 
the system should not be charged twice for the 
same equipment or service. For example, if a 
card reader is used both on-line and off-line, 
the full cost of the card reader should not be 
shown twice. Similarly, program development, 
if performed by users — ^not contractor person- 
nel — and personnel cost should not both be 
costed. 

Basically, the thought behind the cost items 
can be summed up by saying, “Any differen- 


tiating costs that exist between systems, should 
be recorded for evaluation.” 

The second major category of items given 
points in the Weighted Factors Selection 
Method was Equipment Characteristics. 

Equipment Characteristics 

The Cost-Value Technique does not consider 
any equipment characteristics, in themselves, 
to be important extras. Instead, their signifi- 
cance is measured in terms of the running time 
of the system which in turn determines the 
system’s cost and expansion potential. 

TjT)ical of the kind of equipment character- 
istics now being discussed are: the relative 
speeds and capacities of the systems ; hardware 
compatibility; switchability; reliability; and 
some other special features. These obviously 
are features (extras) that determine the time 
required to complete the applications specified 
and in turn determine how much slack time is 
available for expansion, and as such, will be 
evaluated in the next section on Expansion 
Potential. 

Sample problem timing items (time required 
to perform some specific set of sample prob- 
lems) should not be evaluated. They should 
be used exclusively for the validation of the 
application timings quoted in the proposals. 

Other characteristics (size, weight, etc.) 
either are included in the space requirements, 
which are costed; or will appear as special 
items under a new category, ‘Other Extras.’ 
This new category is necessary when using the 
Cost- Value Technique because many extras to 
be evaluated do not fit under any of the exist- 
ing categories. 

The third major category given points in the 
Weighted Factors Selection Method was Expan- 
sion Potential. 

Expansion Potential 

The Cost-Value Technique looks at Expan- 
sion Potential in a new way, which is thought 
to be more meaningful, and evaluates the cate- 
gory by considering the value of the extra 
offered. 

To estimate the Expansion Potential of a 
system, it is necessary first to calculate the run- 
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ning time required by the system to complete 
all required applications. And, at this point, 
we are compelled to list the elements and 
aspects of a computer and its use that con- 
stitutes its running time. The items that must 
be considered in any calculation of the running 
time of a system are listed below : 

1. Speed 

a. Central Processor 

b. Peripheral Equipment 

c. Auxiliary Equipment 

2. Capacity 

a. Central Processor 

b. Peripheral Equipment 

3. Special Features 

a. Parallel Processing 

b. Simultaneous Operations 

c. Other 

4. Reliability 

a. Switchability 

b. Error Detection and Correction Fea- 
tures 

5. Preparation Time 

a. Set-up/Take-down 

b. Program Insertion 

c. Media Handling 

6. Non-productive Time 

a. Reruns 

b. Program Checkout 

7. Software Efficiency 

a. Compiled Languages 

b. Assembled Languages 

Central processor or peripheral equipment 
speeds are not new approaches to establishing 
the comparative merits of competitive equip- 
ment systems. What may or may not be new, 
but which nevertheless must be considered, is 
the interaction between the central processor 
and the peripheral equipment. If the time re- 
quired to do the processing called for in the 
run is great enough, it might cause delay in 
the input and output of the data. Thus, what 
must be determined is the effective speed of 
the peripheral equipment when operated in 
conjunction with the central processor. Also, 
the effect of auxiliary equipment on the system 
must be ascertained to assure its adequacy to 
meet the requirements of the input device. 


The capacity of the central processor is im- 
portant because it determines the number of 
program steps, plus data that can be accom- 
modated at one time. Normally, the larger 
the capacity the faster the system because 
fewer load steps are necessary and because 
sorting and merging functions can process 
larger volumes of data per pass. The capacity 
of the peripheral equipment is also important. 
The printer is a good example of what is meant 
here. A printer having 160-print positions 
may permit ‘two-up’ printing whereas a 120- 
print position printer may permit only ‘one-up’ 
printing, thus doubling its printing time. 
Again, a printer capable of printing eight 
copies may be able to do in one printing run 
what it would take some ‘six copy’ printers two 
runs to do. In both of these cases, the capacity 
of the specific peripheral equipment would ma- 
terially affect the total running time of the 
applications. 

The special features offered will usually 
affect the total running time of the system. 
Obvious examples of this are parallel process- 
ing, buffering, and simultaneous operations. 
The use of these features usually requires some 
investigation, but their applicability to calcu- 
lating run time is immediately apparent. Other 
special features should be examined to deter- 
mine if their value is to decrease total rup 
time. When such is the case, their influence on 
run time should be calculated and used. 

The use of the above three items, speed, 
capacity, and special features, are all rela- 
tively straightforward in determining total run 
time. Now, however, we must determine the 
influence that the systems’ reliability exerts 
on its total run time. How is this influence to 
be determined? This area of determination is 
admittedly an area of ‘guestimation’ rather 
than definable fact. However, it is felt that a 
better ‘guestimation’ can be made here about 
its influence on total running time than can 
be made in the absolute on its proper weighted 
scoring influence on the total selection. The 
area of concern here is system reliability. This 
is made up of a number of sub-items : reliability 
of individual units, number of units available, 
switchability of these individual units, the 
availability of error detection and correction 
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techniques, etc. The intent of considering the 
above items is to determine the number of 
hours of downtime per month that could be 
expected for each of the systems proposed and 
the frequency of the errors resulting from the 
systems unreliability. Historical information, 
if available, should be used for ‘ball park' fig- 
ures. From these figures, educated guesses can 
be made of the time required to correct the 
processing effect of these errors and the num- 
ber of hours of system downtime during which 
the system is not available. The withdrawal 
of these two figures from the total time avail- 
able of the system gives a new systems time 
available figure. 

The preparation time factors must be deter- 
mined and added to the processing times of each 
run for each of the systems proposed. The 
set-up and take-down times are concerned with 
the time required to prepare the peripheral 
device for a run and to take-down completed 
work after a run. Program insertion time is 
the estimated time required to insert the pro- 
gram into the system’s memory. Media han- 
dling time is the time required to change any 
media (magnetic tape reels, new printer paper, 
etc.) that must be changed while the run is in 
process. 

The non-productive time for an application 
must be estimated and its time added to the 
total system time. Non-productive time in- 
cludes rerun time due to operator or pro- 
grammer error 1 and program checkout time. 
The magnitude of this factor (usually assigned 
as a percentage of the productive time) is 
dependent upon the experience of users’ per- 
sonnel and an estimation of the number of new 
programs to be checked out each month. This 
factor could change (should decrease as the 
system gets older) from year to year. 

Another big timing factor, upon which little 
time has been spent to date, is software effi- 
ciency. The efficiency of the software is a very 
important factor since low efliciency may in- 
crease the running time by 25% or more. This 
factor is still in the ‘questimaticn’ stage, but 
with work reliable efficiency factors could prob- 
ably be determined for all the systems proposed. 

Now that all the items necessary for the de- 
termination of total running time, except the 


application, have been discussed, we will see 
how this running time is used in determination 
of the expansion potential. 

The determination of expansion potential 
cost-value is made by determining the value of 
an extra ‘time-block’ (increment of time) and 
deducting from this value the cost of obtaining 
the extra ‘time-block.’ The cost-value of suc- 
ceeding time-blocks should be determined until 
a zero value is obtained. 

The use of time-blocks should probably be 
explained. Time-blocks are used because addi- 
tional time on a computer system is recognized 
to have a decreasing value as more of it be- 
comes available. Thus, the first time-block is 
of more value than the second or any succeed- 
ing time-block. This is because the first time- 
block is obviously much more likely to be used 
than any succeeding time-block; indeed, this 
probability of use is one of the ways of assign- 
ing a cost-value to a specific time-block. But, 
why use time-blocks rather than hourly incre- 
ments or percentage expansion figures? Any 
of the three could be used. Time-blocks were 
used because of their convenience. With hourly 
increments, a great deal of computation is in- 
volved. Percentage expansion figures, while 
likely to be more accurate, involve consider- 
ably more work. The extra work required by 
hourly or percentage approach is not war- 
ranted, since the total result cannot be more 
accurate than value assignments given and the 
value assignments regardless of the time 
method used, are still just estimates. Which- 
ever method is used, the important thing is the 
decreasing value assigned to later time units. 

The determination of this cost-value of ex- 
pansion potential should only be done once. 
The expansion considered is the expansion po- 
tential in the system after its last expansion 
phase (last year of stated expansion) has been 
met. At this point, the future expansion poten- 
tial of the systems become the ‘extra’ being 
measured between systems. 

Let us take an example with two variations 
to illustrate the concept explained above. As- 
sume that two systems in the $20,000 a month 
class are proposed to handle the applications 
specified. The running times for the two sys- 
tems, when extended to include all the items 
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necessary, are found to be 150 hours a month 
for system A, and 200 hours for system B. 
Further, system A requires 60 hours a month 
of maintenance and system B requires only 40 
hours a month. 

1. If the application is one in which there 
is little likelihood of having additional 
expansion, then time-blocks may be made 
rather long and their per hour value 
rather small — say 100 hours per time- 
block and $50 per hour for the first block, 
$25 per hour for the next time-block, and 
$10 per hour for each time-block there- 
after. Thus, system A would have a value 
of (assuming 530 hours a month maxi- 
mum for both systems) $8700 and sys- 
tem B would have a value of $8200. From 
these value figures would have to be sub- 
tracted the cost of the extra shift machine 
rental, the cost of the personnel required 
to operate the system, the cost of any 
extra equipment necessary, and the costs 
of the extra system maintenance required. 
If there was any value remaining, it 
would be called the cost-value and cred- 
ited once to the system. 

2. If the application is likely to grow (or 
if there is the possibility of sub-leasing 
extra time to a second party), then time- 
blocks may be made relatively small and 
their per hour value rather high — say 
20 hour blocks and $150 or more per hour 
for the first time-block, with each suc- 
ceeding block decreased by $5.00 per 
hour. Again, the cost-value for the two 
systems would be computed by adding up 
the value and subtracting the cost to ob- 
tain these hours. With a large valuation 
placed on a time-block, the costs of going 
to higher speed or to larger numbers of 
peripheral equipment might be justified, 
if so, the cost of the higher speed units 
or extra units would also have to be sub- 
tracted. The cost-value remaining would 
be applied one time as a credit. 

The last major category given points in the 
Weighted Factors Selection Method was System 
Support. 

System Support 

The Cost-Value Technique considers the 


value of the extra offered. There are several 
methods of assigning cost-values to the System 
Support items listed in Appendix I. 

The simplest and perhaps the best method 
of cost-value assignment is to simply request 
the other vendors to quote costs to supply a 
service equal to what is considered ‘best’ in 
each case. Thus, if one vendor offers 24 hours 
on-site maintenance, and the other vendors 
don’t, it might prove very meaningful to ask 
the other vendors what the extra charge would 
be (and then credit the largest cost to the ven- 
dor already suppl 5 dng the item, and the largest 
cost minus each specific vendor’s cost for the 
item would be credited to each of the remain- 
ing vendors). The same t 5 q)e thing could be 
done for program assistance, personnel loaned, 
training, program testing, documentation, and 
special software. However, sometimes the 
costs quoted would be so excessive that it would 
not make a fair base against which to award 
value. For instance, if a user was impressed 
by some special programming routine, he might 
well ask the various vendors for the cost of 
suppljdng such a routine. But, he might re- 
ceive answers of tens and hundreds of thou- 
sands of dollars, where if he himself were to 
go out and procure such a routine he would not 
be willing to pay over five thousand dollars. In 
such a case, the five thousand dollars should 
become the base. Thus, in cases where the user 
places a value on a service, lower than a ven- 
dor’s cost, this value figure becomes the base 
for determining the item’s value. In some cases, 
however, i.e., maintenance offered, or documen- 
tation, the vendor may not be able to give cost 
figures for supplying service equal to vendors 
because he just doesn’t have the facilities nec- 
essary to give equal service. In such a case, 
the cost-value of such a service must be indi- 
vidually determined and might be considerably 
higher than the costs charged by any other 
vendor. In such a case, this higher cost-value 
figure should become the base. 

These system support items might also be 
handled by making it a mandatory requirement 
that specific amounts of these items be supplied. 
(This method is not recommended since a small 
vendor may not be in a position to meet some 
of these requirements, and thus he would be 
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eliminated for items that should properly be 
treated as ‘extras’ or at most specific cost 
items.) 

The cost-value of these items might also be 
ascertained by the user, by taking each item 
in turn and truly determining its value to him. 
If the vendor agrees to loan three programmers 
for three months, what is this really worth? 
He will get nine man months of work by an 
experienced programmer which might be the 
equivalent of fifteen months of programming 
by his own people (or a new person added), 
but it may also represent a very important 
time saving (ready to receive the system 
sooner) . Here are a couple of cost-value items, 
the cost-value most closely representing the 
users needs should be chosen. However, the 
cost-value should never exceed the cost of hav- 
ing the service contracted by someone else, thus, 
just because a helping hand (personnel loaned) 
by the vendor will result in the user being 
ready for the system installation three months 
earlier and thus result in a $100,000 a month 
savings for those three months, this doesn’t 
make the value of the personnel loaned $300,- 
000, because, people of the same caliber might 
have been hired from a software consulting 
service for $40,000. This $40,000 would then 
be the cost-value assigned. 

Some items like back-up available and debug- 
ging facilities are support items on which the 
vendors cannot be asked to change or improve, 
therefore, their cost-value has to be evaluated 
as the items are proposed. An approach to de- 
termining the cost-value of back-up would be 
to determine the probability of experiencing 
a catastrophic failure, and then determin- 
ing the cost associated with carrying on the 
computer activities on the back-up facili- 
ties available. The costs times the probability 
of its happening should give the probable cost 
for the various systems. Again, cost-value 
credits can be made out of these costs figures, 
by simply taking the highest cost minus a spe- 
cific vendor’s costs to determine his credits. 
Cost-value determination for debugging facili- 
ties could be handled in much the same way. 

Some items previously covered in other 
categories in the Weighted Factors Selection 
Method have not yet been discussed in the Cost- 


Value Technique. These items are covered 
under the next category. 

Other Extras 

There are many other extras that might be 
offered by a vendor. Items like desirable com- 
patibility or memory lockout can be handled by 
determining the costs that will be eliminated 
by the inclusion of such abilities. Thus, the 
costs that would have to be paid to convert 
tapes of one sort to another would be saved if 
the two systems were compatible; this cost 
therefore becomes the cost-value of such com- 
patibility, or the costs of the time and trouble 
that could be saved by the inclusion of a mem- 
ory lockout device becomes its cost-value. 

An important extra that will frequently be 
found in proposals deals with timeliness. A 
system proposed may claim to be able to allow 
management to have access to any information 
within the file in less than one minute, or to 
have management reports ready by 1:00 p.m. 
every day, or etc. In these cases, a study must 
be initiated to determine the cost-value to man- 
agement of being able to have one minute ac- 
cess, rather than 10 minute access as proposed 
for other systems; or of having the reports 
ready by 1:00 p.m. rather than 3:00 p.m. as 
proposed for the other systems, etc. Timeliness 
cost-values will usually have to be established 
by management (the group to whom the ‘time 
is money’ statement has the most meaning), 
but management should also be required to sub- 
stantiate their cost-value assignment by show- 
ing the saving or advantages that will result. 

Another type of extra, and an extra that is 
perhaps worth all the trouble necessary in 
making the system study, is the possibility of 
a new innovation or systems approach. The 
cost-value to be assigned would be equal to a 
realistic determination of the saving that would 
be accrued by using the suggested approach. 

Summary of Proposed Cost-Value Technique 

We have been briefly exposed to some tech- 
niques for determining the cost-value of a num- 
ber of items that should be included in any 
selection. The cost-values derived for the vari- 
ous vendors are applied as credits to offset 
the costs of the system and services he proposed. 
The vendor having the smallest difference (out- 
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of -pocket cost minus credits), is the vendor to 
whom the contract should be awarded. 

The Cost-Value Technique of computer selec- 
tion does not do away with all the problems 
found in any other type of objective evaluation 
and selection technique. All it tries to do is 
make the difficulties a little easier to overcome, 
the values assigned a little more realistic, and 
the resulting selection a little more meaningful 
and understandable. It does this through two 
important principles : 

1. First, in an effort to keep the selection 
simple and straightforward, the Cost- 
Value Technique recognizes only two 
categories of factors : 

a. Costs; which are a function of the 
equipment costs multiplied by the 
time required to complete the appli- 
cation, and many other cost items, 
and 

b. Extras ; which are any items of value 
that are inherent in the costs of one 
system proposed, but not to all sys- 
tems proposed, and do not directly 
influence the system's running 
time. Thus, extra maintenance 
service or extra expansion capabili- 
ties which are procured as part of 
the basic system are extras, but 
equipment characteristics like speed 
of central processor and peripheral 
equipment, or memory size, are not 
extras since they directly influence 
the running time which in turn di- 
rectly influences the cost of the 
system. 

Thus, an item is only evaluated once, 
either by its influence on cost — directly 
or via the running time of system — which 
in turn is a cost factor, or by its value 
as an extra. 

2. Second, by using dollar cost or value as 
the basis for scoring all the ‘extras' 
offered, a common denominator is ob- 
tained by which values can be unc^erstood, 
discussed, and changed independently of 
each other. Also, the ‘extras' scores can 
be directly related to the ‘costs' scores. 

I feel the Cost-Value Technique is a big im- 
provement over existing objective evaluation 


techniques, but it too can evolve. The next 
section briefly mentions some of the further 
improvements that might be made in Cost- 
Value Techniques. 

POSSIBLE ADDITIONAL IMPROVEMENTS 

A number of improvements might be made 
to the Cost- Value Technique. The use of debits, 
as well as of credits, could be used immediately 
and might make the technique a little more 
natural. But, two other improvements, time 
dependent cost-value assignments and quality 
determination, require more work with and a 
more thorough understanding of the Cost- Value 
Technique before their importance can be fully 
measured. 

Use of Debits 

The Cost- Value Technique could be improved 
by the use of debits in addition to credits. With 
‘debits,’ some items such as penalties for fail- 
ure to meet certain requirements (e.g., late 
submission of proposals, late delivery of equip- 
ment, etc.), could be more easily handled. Also, 
in the present technique certain desirable items, 
if present, are awarded credits; however, it 
might be more meaningful if their absence 
from a proposal and/or system was at the risk 
of debits. Thus, credits w^ould be used to re- 
ward systems really proposing some valuable 
‘extras,’ while proposals lacking certain fea- 
tures considered valuable in the original sys- 
tem specifications would be negatively recog- 
nized by the assignment of debits. 

Time Dependent Cost-Value Assignment 

A matter of concern in the present Cost- 
Value Technique is the fact that the costs to be 
incurred in the future (rental of equipment 
four years hence) have as much weight and 
value as the same categories of costs which are 
to be incurred in the present or the immediate 
future. Thus, I may be relatively certain that 
I will be paying $12,000 a month in rental this 
year, but I am not too certain that I will be 
paying $12,000 a month five years from now 
because the system requirements may have 
changed or I may have a much larger or a much 
smaller system at that time. Perhaps what is 
needed to equate such consideration is a time 
dependent, cost-value assignment. This assign- 
ment might be produced by multiplying the 
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cost of the item by the probability that the same 
cost will ever be paid. This approach would 
tend to make one-time costs more significant 
in the selection and minimize the amount of 
influence which ‘expansion potential’ tends to 
exert on the selection. 

Quality Determination 

The Cost-Value Technique, as discussed, does 
not provide a value assignment for the quality 
of supplied documentation or of the instruction 
to be given, etc. These quality items are of im- 
portance and eventually must be incorporated 
into the technique. However, much data will 
have to be collected on such items as these to 
build a body of experience, to create the limits 
of their importance, and to relate their im- 
portance with other elements of *the system. 

There are undoubtedly other modifications 
that could be made to the Cost- Value Technique. 
But, changes are expected and needed if a tech- 
nique is to grow. However, whatever changes 
might be made, they should not affect the basis 
for the Cost-Value Technique, which is the use 
of costs as the common denominator when re- 
lating the various items that must be consid- 
ered when evaluating computer proposals. This 
principle of using costs as the common denomi- 
nator when valuating items to be included in 
an evaluation technique is the heart of the 
Cost-Value Technique, and it is likeily to be a 
basic principle that will be incorporated into 
most of the new objective evaluation techniques 
that will evolve over the next 20 or so years. 

SUMMARY 

The Cost-Value Technique proposes two 
major changes to existing evaluation tech- 
niques. 

The first thange is in methodology. The Cost- 
Value Technique attempts to consider all items 
of value to a computer system, but to consider 
them only once and in the environment in which 
they belong. The categories scored are total 
system cost and ‘extras’ which are defined as 
features like expansion potential, vendor sup- 
port, or similar characteristics which are part 
of total system cost, but differentiating between 
vendors. 


The second change is in scoring technique. 
The Cost-Value Technique uses dollars rather 
than weighted points as the basis of compari- 
son. This provides a more natural basis for 
comparison. It eliminates the need for ‘trade- 
offs’ and gives management d^per understand- 
ing of the total selection process. 

These two changes are intended to bring 
about a more understandable and realistic 
selection. 

The Cost-Value Technique is intended to be a 
dynamic technique to which additional modifi- 
cations might be made. Some possible future 
improvements that might be incorporated into 
the Cost-Value Technique have also been shovpn. 

APPENDIX I 

Items Considered in Sample 
Weighted Factors Evaluation Method 

COSTS 

One-Time Costs 

Site Preparation 
Electrical 

Air Conditioning (Cooling, Heating, 
and Humidity Control) 

Power Supply (including all wiring) 
Construction 

Facilities (Space, walls, ceiling, paint- 
ing, draperies) 

False Flooring (including bracings) 
Equipment Installation 
Equipment Transportation (including insur- 
ance cost) 

Vendor Support 
Personnel 
Analysts 
Programmers 
Operators 
Instructors 

Training (including cost of transportation 
and living costs, if training not provided 
on-site) 

Existing Programs 
Back-up Facilities 
Machine Time (checkout) 

Documentation 

Program and Data Conversion 
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One-Time or Contimdng Costs (Dependent 
upon procurement method used) 

Central Processor and Associated Equipment 
Central Processor 
Console 

Floating Point Option 
Multiply Option 
Real Time Option 
Memory Units 
Etc. 

Peripheral Computer Equipment — On-Line 
or Oflf-Line 

Remote Inquiry Device 
Card Reader 
Card Punch 
Printer 

Magnetic Tape Units 

Immediate Access Storage (IAS) Units 

Paper Tape Reader 

Paper Tape Punch 

Controllers and Buffers 

Micr, Optical Scanner, etc. 

Auxiliary Equipment 

Key Punch Machines and other data 
Creation Devices (flexiwriter, teletype ma- 
chine, etc.) 

Continuing Costs 

Operation and Maintenance of all Electrical 
Equipment (Above Computer System Equip- 
ment, plus Air Conditioning, etc.) 

Personnel 

Managers 

Analysts 

Programmers 

Operators 

Others (key punch operators, etc.) 

Program Development 
Supplies 

Magnetic Tape 
Cartridges for IAS 
Printer Paper 
Cards 

Programming Forms 
Etc. 

Indirect Cost-Space Used 

EQUIPMENT CHARACTERISTICS 
Speed 

Time Required to Complete Applications 
Specified 


Instructions 

Add time (fixed and floating) 

Multiply time (fixed and floating) 

Divide time (fixed and floating) 

Move 

Etc. (through all other instructions though 
significant) 

Peripheral Equipment 
Printer (lines per minute) 

Card Reader (card per minute) 

Card Punch (card per minute) 

Magnetic Tape Units (characters per sec- 
ond) 

IAS (characters per second average) 

Etc. (through all other peripheral equip- 
ment listed) 

Capacity 

Characters of Storage in Main Memory 
(core) 

Characters of Storage in IAS 
Characters of Storage on Magnetic Tape 
Length of printed line, in characters 
Card size 

Length of Paper Tape 
Etc. 

Compatibility 

Program 

Tapes 

Cards 

Switchahility 
Magnetic Tape Upits 
Printers 
Other 

Reliability 

Error Detection 
Parity Checks 
Validation Checks 
Accuracy Checks 
Error Correction Techniques 
Near-Time-to-Failure (etc.) 

Special Features 
Memory Lock-out 
Parallel Processing 

Problem Timings — Sample Problems 
Central Processor Limited 
Input/Output Limited 
Balanced 
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Other Characteristics 

Size of Equipment (each piece considered) 
Weight of Equipment (each piece consid- 
ered) 

EXPANSION POTENTIAL 

Slack Time (Amount of available free time on 
each piece of system equipment) 

Central Processor 
Magnetic Tapes 
IAS 

Card Punch 
Printer 

Etc. (Through all other system equipment 
offered) 

Maximum Expansion (Number of units that 
can be added to system) 

Magnetic Tapes 
IAS 

Card Punch 
Printer 

Etc. (Through all other system equipment 
offered) 

SYSTEM SUPPORT 

Program Assistance 
Development 
Writing 
Converting 

Training 

Analysts 

Programmers 

Operators 

Maintenance Offered 

Backup Availability 

Program Testing 
Hours 
Location 

Existing Software 
Sort 
Merge 
COBOL 
FORTRAN 
Report Generator 
Etc. 


Documentation 

Personnel Loaned 
Analysts 
Programmers 
Operators 
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THE USE OF A COMPUTER TO EVALUATE COMPUTERS 


Donald J. Herman, President, and Fred C, Ihrer, Vice President and Technical Director 

COMRESS, hicorporated 
Washington, D, C, 


THE PROBLEM OF COMPUTER 
EVALUATION 

The complex problem of evaluating and 
selecting the optimum systems approach for the 
optimum computer to solve a particular data 
processing problem, has plagued management 
since the time that computers came into use for 
business and scientific data processing. 

The review, evaluation, analysis and selection 
of data processing hardware/software config- 
urations establishes the necessity for relating 
the functions of hardware performance char- 
acteristics and software program efficiency 
factors to the specifications of the proposed 
computer application. Each of these functions 
comprises interacting and interdependent activ- 
ities throughout the complete evaluation cycle, 
beginning with the delineation of a data proc- 
essing problem and ending with a successfully 
installed and efficiently operating computer. To 
accomplish this effort through a manual method 
of making comparisons and estimates of the 
relative proficiency of the various computers is 
a time-consuming and costly effort which does 
not always produce the ultimate solution. The 
requirement to establish proper relationships 
for evaluation purposes between each of these 
functions identifies the need for a management 
vehicle which will assist management in mak- 
ing a proper and correct computer evaluation 
decision. 

Various attempts have been made at the 
proper solution of this problem but none of 


these approaches has incorporated the maxi- 
mum utilization of the capabilities of a com- 
puter. A detailed analysis of the evaluation 
problem revealed that a series of techniques, 
incorporating a wide range of scientific disci- 
plines, could be utilized to approach an optimum 
solution. However, these techniques demanded 
a prodigious amount of calculation. To attempt 
to apply them to the problem without the aid 
of a computer would have been similar to solv- 
ing complex mathematical problems with the 
use of Roman numerals. 

A COMPUTERIZED SOLUTION TO THE 
PROBLEM 

COMRESS has prepared a software package 
called SCERT, meaning Systems and Com- 
puters Evaluation and Review Technique. 
SCERT is a simulation program which has been 
designed to accept the definitions of a data 
processing problem, and to build a mathe- 
matical model of each program run in the de- 
fined problem. Also, SCERT maintains a 
library of hardware and software performance 
factors for a wide range of digital computers. 
Using the algorithms which have been incorpo- 
rated into the program, it can extract the 
appropriate hardware and software factors for 
all the components in any one configuration. 
With this information, it will build a mathe- 
matical model representing the hardware/ soft- 
ware performance capabilities of each selected 
computer configuration. During the simulation 
phase then, SCERT simulates the response of 
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each of the '‘program models’" against the "per- 
formance model” of each of the selected con- 
figurations. 

The results of this simulation are represented 
in the form of several different management 
reports which furnish the user with projections 
of cost, time, memory and manpower require- 
ments which would be necessary to put his data 
processing system "on the air” for any one of 
the computers evaluated. 

SOME OF THE USES OF THE SCERT 
PROGRAM 

SCERT has been used during the past fifteen 
months by a large number of users representing 
a wide and diversified range of data processing 
problems and systems. These have varied from 
the typical business type sequential processing 
problem to the scientific problem and the real- 
time random access and communication prob- 
lems. 

Hardtvaj'e Selection 

For the data processing installation making 
its initial computer selection, SCERT provides 
management with an extremely valuable tool 
for assuring the selection of the proper com- 
puter hardware. SCERT can be used to evalu- 
ate the performance of an infinite number of 
computer configurations, thus assuring man- 
agement that they have selected the computer 
which most economically meets their processing 
requirements. 

Additionally, the SCERT projections will 
provide the new installation with many valu- 
able guidelines for the implementation of the 
system on the selected computer. SCERT will 
realistically project the programming man 
months involved and data media requirements 
and will establish accurate running time goals 
for the completed programs. Additionally, 
SCERT will aid in systems design by reflecting 
optimized tape block" size, channel assignments, 
internal running times and memory require- 
ments by function of the computer. 

Hardtvare Enhancement or Replacement 

The use of SCERT by an installation which 
is facing the problem of enhancing or replacing 
its present computer hardware can provide 
management with the optimum approach to 


this recurring problem. SCERT can be used to 
simulate a variety of potential hardware en- 
hancements such as faster tape stations, addi- 
tional memory, etc., and will accurately project 
the differences in program running time for 
each of the enhancements considered. Addi- 
tionally, SCERT can be used for considering 
the impact of replacing present hardware with 
larger or more modern hardware and will make 
realistic projections of utilization and cost of 
reprogramming for the various options con- 
sidered. It can also be used to determine the 
effect of running existing programs on com- 
patible machines in compatibility mode situ- 
ations. 

Application Analysis 

SCERT has been used by computer installa- 
tions to evaluate the impact on both their 
utilization and programming resources when 
considering the addition of new applications. 
In this environment it can be used to project 
the running time of the new application, the 
utilization of off-line and auxiliary equipment 
and the programming effort required. It is very 
frequently used to evaluate several different 
systems design approaches for the same prob- 
lem in terms of computer running time and 
programming effort. 

Installation Review 

SCERT has^ also been used by computer 
installation managers who are interested in 
evaluating the performance of their installation 
in terms of the standard projections made by 
SCERT. Since the SCERT projections are 
always based on an optimum use of computer 
hardware available, then these projections pro- 
vide highly realistic goals and standards by 
which actual program performance can be 
measured. This allows management to isolate 
those programs which are most subject to en- 
hancement or reprogramming. 

Hardware Design 

When used in the performance of this func- 
tion, SCERT provides the user with a valuable 
tool for assuring the capabilities of determining 
the correct design specifications of computer 
components, far in advance of committing re- 
search and development funds and valuable 
engineering time. By utilizing this program 
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during the early planning stages for new hard- 
ware, it is possible to evaluate the performance 
of a wide range of product specification vari- 
ations to determine the optimum computer char- 
acteristics required to meet the demands of a 
competitiye market. 

Generally, the procedure used for accom- 
plishing this function is one of defining a series 
of pre-selected and well defined systems appli- 
cations to the SCERT program. By then vary- 
ing the planned performance specifications of 
a “paper machine,” the user is able to determine 
the performance of various computer configura- 
tions, and select the one which best meets the 
desired market performance requirements. 

OPERATIONS OF THE SCERT PROGRAM 

SCERT is an extremely complex and fairly 
lar-ge program. The size of the program ap- 
proaches the magnitude of 31,000 instructions. 
It contains approximately 5,000 algorithms and 
maintains a library of approximately 100,000 
hardware and software factors. 

The program was originally written in a 
COMRESS specially designed language for an 
RCA 301 computer. As an operating software 
package on the RCA 301, it consists of 26 
phases and as such, the program would require 
from two to four hours of running time to 
simulate a problem consisting of 100 computer 
runs for the evaluation of six different com- 
puter configurations. The running time is a 
function of the size of the configurations being 
simulated, and the complexity of the defined 
programs. 

Our experience in defining runs to SCERT 
can best be measured as a range of magnitude, 
rather than an average or typical time. This 
range has varied from a minimum of ten to a 
high of fifty runs and their associated files 
defined in one man day of effort. A typical 
average elapsed time-frame for the complete 
production of a SCERT analysis is approxi- 
mately four weeks. The time required for 
definition is usually a function of the condition 
of documentation and/or the availability of the 
time of the systems analysts who designed the 
system to be simulated. 


COMPUTERS MAINTAINED IN SCERT 
LIBRARY 

At the beginning of 1964, the SCERT pro- 
gram included hardware and software factors 
on each of the following computer central 
processors and related peripheral devices. New 
computers are added to the library within one 
month of manufacturer's announcements. 

Manufacturer Computer 

Burroughs 205 

200 series 
5000 

Control Data Corporation 160A 

1604 

3200 

3600 

6600 

General Electric 215 

225 
235 
425 
435 

200 
400 

1400 
800 

1800 

1440 

1401 
1460 
1410 
7010 

7040 and 7044 
7070, 7072 and 7074 
7080 

7090 and 7094 
705 II and III 

304 
315 

2000 

301 
501 
3301 

SS 80 I and II 
1050 
U III 
1107 
490 


Honeywell 

IBM 


NCR 

Philco 

RCA 

UNIVAC 
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DESCRIPTION OF SGERT PROGRAM 

The SCERT program can best be described 
as the four phase program which is depicted on 
Figure 1. Phase I builds a mathematical model 
of the system and assures that it is a systems- 
oriented model. Phase II adjusts this model to 
the type of machine which is to be evaluated. 
Phase III simulates the performance of these 
models on each of the configurations being 
evaluated. Phase IV produces the necessary 
decision-making data on the several output 
reports of SCERT. 

PHASE I 

In this phase, SCERT creates a mathematical 
model of each computer run in the entire system 
application. The bases on which the model is 
built is the Systems Definitions, Environment 
Definitions and the File Definitions. 

The Systems Definitions (Figures 2 and 3) 
are basically a verbal portrayal of the systems 
chart reflecting the frequency and occurrence, 
the number and identity of I/O files, and the 
type of internal computer activity. 



Figure 1. Schematic of SCERT Program. 


Several classes of mnemonic codes are used 
to define internal computer activity; some of 
these are ; 

GeneralJob Codes 
Sort 
Merge 

Sequence Memory 

Data Processing Job Codes 

Update 

Extract 

Validate 

Mathematical Job Codes 
Square Root 
Sine 
Cosine 
Arc Tangent 
Etc. 

Matrix Job Codes 
Matrix Multiply 
Matrix Transpose 



Figure 2. Systems Definitions — Part 1. 
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Figure 3. Systems Definitions — Part 2. 


Matrix Invert 
Matrix Subtract 
Matrix Divide 
Matrix Add 

Basic Computer Activity Codes 
Data Move 
Data Compare 
Data Edit 
Data Translate 
Data Add 
Data Divide 
Data Multiply 
Data Subtract 

The File Definitions (Figure 4) reflect the 
basic parameters of each file. These specifica- 
tions include such data as the number of rec- 
ords, number of characters per record, the 
number of alpha and numeric fields, and the 
category and data media of the file. 

The Environmental Definitions (Figure 5) 
provide SCERT with the necessary factors or 



Figure 4. File Definitions. 


factor adjustments for all functions based on 
pure judgment. This input phase of SCERT is 
an open-ended design to allow for the individual 
requirements of each user. 

The types of data which are introduced by 
these definitions are: 

Software Environment 
Personnel Environment 
General Environment 

The Software Environment division provides 
the user with a method of specifying to SCERT 
the programming language, the Input/Output 
Control System, and the Program Monitor Sys- 
tem he will use. 

The Personnel Environment division defines 
for SCERT the experience level of the user’s 
programming staff. SCERT mathematically 
relates this to the complexity of the systems 
application and the programming difficulty of 
the language to be used for the configuration 
to be evaluated. 
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Figure 5, Environmental Definitions. 


The General Environment division allows the 
user to specify the length of time he plans to 
keep the computer, and such other factors as 
the percentage of the total application that the 
evaluated system definitions represent. 

PHASE II 

Phase II of the SCERT program accepts the 
Computer Complement Definitions (Figure 6) 
which define each of the various computer con- 
figurations to be evaluated. Based on these 
definitions, it extracts from its library of hard- 
ware and software factors, those required for 
each computer configuration. SCERT arrays 
these factors in the form of a three-dimensional 
matrix and then modifies the mathematical 
models of the runs in the system to conform to 
the hardware requirements of the particular 
computer. For example, the record sizes in- 
volved are, if necessary, made multiples of the 
word structure of the computer; and assign- 
ment of the input and output files is made to the 
appropriate peripheral devices available. 


Figure 6. Computer Complement Definitions. 

The Computer Complement Definitions which 
are input to this phase are simply an identifi- 
cation of the manufacturers’ model numbers, 
the quantity of each in the configuration, and 
the function code which specifies how the 
particular model will be used. The function 
code can specify that a unit will be operated 
on-line, as a satellite or in parallel. The defini- 
tions must also include the model number of the 
satellite computer, if any, as well as the model 
numbers of all peripheral and control devices 
associated with the satellite. 

During both Phases I and II, SCERT per- 
forms a complete validation of all input defini- 
tions. It checks for proper coding and for 
consistency of coding. The systems application 
is continually checked to insure its validity, 
especially in terms of the computer configura- 
tion being evaluated. For example, SCERT will 
determine if sufficient peripheral devices are 
available and are of the right kind to handle all 
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input and output functions. Errors discovered 
by SCERT in performing these validations are 
printed and the user has the option of continu- 
ing or restarting the program. 

PHASE III 

The basis for most of the decision theory 
techniques incorporated into SCERT is the 
simulation of the performance of each computer 
configuration for each computer run model to 
be evaluated. This simulation technique can 
best be described as the explosion of the com- 
puter run into its maximum number of thru-put 
iterations. SCERT contains over 5,000 algo- 
rithms which it uses to perform this simulation. 
Initially, all input, output and internal compu- 
tation times are calculated. As these timing 
forecasts are developed, a preliminary assign- 
ment of these times is made, based on the pow- 
ers and features of the central processing unit. 
Then, based on the simulation bf the thru-put 
iteration's, the simultaneous and net times are 
derived for each run. As a by-product of this 
simulation, SCERT computes the number of 
program steps, amount of memory used, number 
of tapes required for input and output files, etc. 
In addition, a pre and post run history is 
developed to forecast the utilization of periph- 
eral equipment such as key punch, off-line card 
reading, satellite computer time, and other re- 
quirements. 

The various terms in the algorithms are 
products of several sources. The system input 
definitions furnish a primary source of terms. 
Another set is provided by the environmental 
definitions. A third set of terms is derived by 
SCERT from historical and statistical tables, 
and another important source is furnished by 
the program’s library of hardware and soft- 
ware factors. Currently, this library contains 
fifty computer systems which are reflected by 
over 100,000 different hardware and software 
factors. 

PHASE IV 

This phase of SCERT provides the user with 
various levels of data developed by SCERT 
during the course of simulation and evaluation 
computations. The several output reports which 
are generated by SCERT during this phase have 


been designed to provide the user with a com- 
prehensive range of detailed and summary 
management information. These reports repre- 
sent a digest of several million computations 
and decisions made in a typical evaluation. The 
data represented on some of these reports is, 
to our knowledge, the first attempt ever to have 
been made to provide the managers of a data 
processing installation with data which can be 
used for management purposes. One of the 
examples of this type of data is shown on the 
SCERT Detail Systems Analysis report which 
provides the programmer with much of the in- 
formation which he would, under ordinary 
circumstances, have to develop through trial 
and error methods prior to the writing of his 
program. For instance, SCERT will optimize 
for each file in every program the record size, 
the block size, the input/output media assign- 
ment, and the I/O channel assignment. 

Computer Complement Report {Figure 7) 

This report is provided for each computer 
complement evaluated and serves mainly as an 
identification of the computer configuration. It 
reflects, by type of hardware device, the quan- 
ti^ of each model number in the configuration, 
the purchase price, minimum monthly rental, 
and the environmental requirem.ents of floor- 
ing, cooling, and the power requirements. 

Central Processor Utilization Report {Figure 
S) 

This report summarizes, by program run, the 
SCERT forecast for program running, set-up/ 
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Figure 8. Central Processor Utilization Report. 


Figure 10. Programming Requirements Report. 


take-down time, and memory utilization. It 
further provides information for those appli- 
cable computer systems on the parallel data 
conversion that could be scheduled during each 
program run. After listing these forecasts, 
SCERT then summarizes the projections for 
daily, weekly and monthly average utilization 
time, and finally, prints the projections for peak 
period utilization. 

Satellite Computer and Auxiliary Equipment 
Utilization Report {Figure 9) 

Based on the pre and post run history devel- 
oped during the SCERT simulation, this^ report 
forecasts for each computer run in the system, 
the satellite computer and auxiliary equipment 
utilization. Auxiliary equipment utilization is 
subdivided by function such as card to tape, 
tape to print, and other off-line data conversion 
operations. Also provided is the number of 
hours of data preparation work, such as key 


punch time, 

required to support each run. 
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Figure 9. Satellite Computer and Auxiliary Equipment 
Utilization Report. 


Programming Requirements Report {Figure 
10 ) 

This report, which is also a recap by individ- 
ual program run, projects the number of pro- 
gram steps required to put the program “on 
the air,'' and then makes an estimate of the 
user's programming effort in man months. It 
further reflects the number of program steps 
saved by using applicable sub-routines and 
utility programs furnished by the manufac- 
turers. The programming effort in man months 
projection is a function of the number of steps 
in the program related to the programming 
language specified by the user in the Environ- 
ment Definitions. This computation is made in 
relation to the experience of the user's pro- 
gramming staff correlated to the programming 
difficulty of the computer involved, and the 
nature of the individual program run. 

Application Summary Report {Figure 11) 

This report is a summation of all program 
runs within each application area and is pro- 
vided to assist the user in determining the 
priority to be assigned to the various applica- 
tions for implementation purposes. It shows 
the monthly system utilization by application 
together with a projection of programming 
effort and associated cost. 

Cost Summary Report {Figure 12) 

This report is provided for each computer 
configuration evaluated. It is a management 
digest of the costs associated with installing a 
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Figure 12. Cost Summary Report. 

computer and operating it once all applications 
are programmed. Recurring costs show the 
monthly rental computed by SCERT, based on 
the projections of utilization. If manufacturers 
have several rental options, SCERT will have 
selected the optimum. Shown for comparison 
purposes are the purchase costs which include 
the cost of maintenance. In addition to the 
equipment costs, the recurrent cost analysis 
also reflects personnel salaries for equipment 
operation and data preparation and program 
maintenance. 


analysis of internal computation, including 
time, memory, and program steps by function ; 
i.e., arithmetic computations, decision and con- 
trol, data handling and input/output control. 
Part III, a Special Time Analysis, forecasts the 
time required for program insertion time, pro- 
gram delay time due to errors, end of job 
rewind time, and other special functions. 
Finally, all timing forecasts are totaled to a 
net program running time. This report, in ad- 
dition to providing detailed documentation and 
back-up for all other SCERT reports, can be 
invaluable to an analyst or programmer when 
designing the running program. It will solve 
such programming problems as input/output 
channel assignments, and the determination of 
optimum record size, blocking factor and pro- 
gram segment size. 

COMPUTER COMPLEMENT REPORT 


The One-Time Cost Analysis shows the costs 
of acquiring data media such as magnetic tapes. 

Detailed Systems Analysis {Figure IS) 

This is an optional three-part report which 
provides a comprehensive analysis of each com- 
puter run as simulated by SCERT. Part I 
reflects for each input and output file the total 
buffered and unbuffered time, the input/ output 
device and channel assignment selected by 
SCERT, the number of reels, if magnetic tape 
file, and the optimum records per block devel- 
oped by SCERT. Part II of the report is an 


Report Explanation 

This report reflects descriptive information 
for each computer configuration to be evaluated. 

Column Explanation : 

1 MODEL NUMBER — Identifies each 
component in the configuration. 

2 MANUFACTURER— Identifies man- 
ufacturer of each component. 

3 QUANTITY IN SYSTEM— The com- 
ponent quantity in each configuration. 

4 PURCHASE PRICE— For each com- 
ponent and associated special features. 
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5 MINIMUM MONTHLY RENTAI^ 
The minimum rental option for each 
component. 

6 FLOORING SQ. FT.— Represents the 
minimum number of square feet of 
physical and work areas requiring 
false flooring for each component. 

7 COOLING BTU/HR.— Shows BTU^s 
required for cooling each component. 

8 POWER SUPPLY— Reflects power 
requirements for each component. 

9 CENTRAL PROCESSING UNIT— A 
subdivision of this report which shows 
all equipment connected on-line to the 
main frame. 

10 PERIPHERAL DEVICES— The name 
of each component shown for the con- 
figuration. 

11 SPECIAL FEATURES— Reflects all 
special features and control devices. 
All costs and other factors are shown 
with the appropriate device. 

12 SATELLITE COMPUTER— Reflects 
the satellite computer (if required) to 
support the central processor. 

13 TOTAL EDP SYSTEM— Reflects 
totals for all components for purchase 
price, monthly rental, floor require- 
ments, BTU^s and power in KW and 
KVA. 

CENTRAL PROCESSOR UTILIZATION 
REPORT 

Report Explanation 

This report reflects analytical information 
computed by SCERT on each program run in 
the entire system for each computer configura- 
tion evaluated. 

Column Explanation : 

1 PERIOD — Identifies the periodic oc- 
currence of the computer run. 

2 FREQUENCY— Identifies the fre- 
quency of occurrence within the pe- 
riod. 

3 RUN NUMBER — An identification 
number assigned by the analyst. 

4 RUNNING TIME— This is the pri- 
mary projection developed by SCERT 


simulation and represents elapsed 
running time of the program. It is 
based on optimum utilization of mem- 
ory, instruction power and simultane- 
ity. 

5 SET-UP — Reflects net amount of set- 
up and take-down time required. 

6 TOTAL TIME — A summation of run- 
ning time and set-up time. 

7 MEMORY CHARACTERS UTI- 
LIZED — Represents total require- 
ments for each program, including in- 
struction, work areas, I/O data, and 
software routines. An asterisk indi- 
cates memory has been exceeded, and 
SCERT has segmented. 

8 PARALLEL PROCESSING— When 
equipment has the capability of per- 
forming parallel processing; SCERT 
calculates the effective parallel con- 
version time available. This figure is 
computed within the parameters of 
memory, operational time and I/O 
channels available during each run. 

9 RUNS TOTAL — A summation of the 
SCERT projections of running time 
for all runs occurring in the period 
indicated. 

10 AVERAGE TIME — The time required 
for all runs plus a proration of the 
running time for runs occurring more 
or less frequently than the period indi- 
cated. 

11 NORMAL MONTH TOTAL— A sum- 
mation of the total time required for 
all computer runs occurring monthly 
or more frequently. 

12 PEAK MONTH TOTAL— The normal 
month time plus the time required if 
all quarterly, semiannual and annual 
runs were to occur during one month. 

SATELLITE COMPUTER AND AUXILIARY 
EQUIPMENT UTILIZATION REPORT 

Report Explanation 

This report reflects analytical information 
computed by SCERT on each program run in 
the entire system for each computer configura- 
tion evaluated. 
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Column Explanation: 

1 PROGRAM RUN NUMBER— An 

identification number assigned by the 
analyst. 

2 SATELLITE COMPUTER— A pro- 

jection of the hours of utilization of 
the satellite computer to support each 
program run. In this example, off-line 
equipment was used for normal data 
conversion operations, consequently, 
no utilization is reflected for a satellite 
computer. 

3 OFF-LINE EQUIPMENT UTILIZA- 
TION— During the SCERT simula- 
tion, a pre and post history is devel- 
oped for each computer run. SCERT 
analyzes this history to determine the 
off-line requirements for converting 
source document data to a media ac- 
ceptable by the configuration being 
evaluated. It also determines the post 
run requirements for such off-line 
functions as printing, punching, etc. 
This off-line utilization is reflected in 
terms of hours per average month by 
type of function for each run, such as 
tape to print and card to tape. 

4 DATA PREPARATION — Machine 
time requirements for all necessary 
data preparation such as key punching 
and paper tape punching for each pro- 
gram run are computed and reflected 
in terms of the hourly requirements 

Q vt/v-hq -rvx ■4-Vi T v\ 4-Vl1 Q OVOiTV^Tvlia 

pfcx dv excise iiivxxitii. xii i/xixo 

source documents were not defined, 
therefore, data preparation time was 
not computed. 

PROGRAMMING REQUIREMENTS 
REPORT 

Report Explanation 

This report reflects analytical information 
computed by SCERT on each program run in 
the entire system for each computer configura- 
tion evaluated. 

Column Explanation: 

1 RUN NUMBER — An identification 
number assigned by the analyst. 


2 PROGRAMMED BY USER— A pro- 
jection of the number of program steps 
which must be programmed by the 
user's staff to prepare each program. 

3 SUB-ROUTINES — A quantitative re- 
flection of the number of program 
steps which will be incorporated into 
each program run as a result of the 
utilization of manufacturer furnished 
sub-routines. 

4 UTILITY ROUTINES— A quantita- 
tive reflection of the number of pro- 
gram steps in the manufacturer fur- 
nished utility routines used in each 
run. 

5 PROGRAMMING EFFORT IN MAN 
MONTHS — A quantitative projection 
of the number of man months required 
to write each program run. This pro- 
jection, although it may be no closer 
than plus or minus 15% of actual pro- 
gram effort required, because of the 
technique and the factors used in ar- 
riving at it, represents the best pos- 
sible projection that can be achieved. 
The factors used for arriving at this 
projection are the number of program- 
mers and their experience levels, the 
complexity of the computer run, and 
the language to be used. 

APPLICATION SUMMARY REPORT 
Report Explanation 

This report reflects the summarization of 
information computed by SCERT at the pro- 
gram run level for each computer configuration 
evaluated. 

Column Explanation: 

1 APPLICATION CODE— The first 
character of program run number will 
be assigned by the analyst to identify 
each application in the system. 

2 PROGRAM TIME— A projection of 
the CPU utilization hours for each 
application for an average month. 

3 SET-UP TIME — Reflects net set-up 
and take-down time required. 

4 SATELLITE COMPUTER UTILIZA- 
TION — A projection of the utilization 
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hours in support of the central proces- 6 

sor for each application for an aver- 
age month. 

5 PROGRAMMING EFFORT IN MAN 
MONTHS — A summarization of pro- 
gramming manpower effort required 

for implementing all of the computer 7 

runs in each application. 

6 PROGRAMMING EFFORT DOL- 


LARS COST — An estimation of pro- 
gramming costs, based on an extension 
of the number of man months of effort 
multiplied by the average monthly 
salary of a programmer, as furnished 
by the user through the Environment 
Definitions. 

COST SUMMARY REPORT 
Repoi^t Explanation 

This report reflects the SCERT projections 
for all costs associated with the operation of 
the entire system for each computer configura- 
tion evaluated for an average month. 

Column Explanation : 

1 BASIC HOURS — Represents the 
number of hours in the rental option 
selected by SCERT. SCERT analyzes, 
in terms of the number of hours of 
component utilization, all options of- 
fered and then selects the optimum 
one. 

2 EXTRA USE — The number of hours 
of extra use required beyond the basic 
hours provided in the option selected 
by SCERT. 

3 MONTHLY RENTAL — Represents 
the total basic hours plus extra use 
hours monthly rental for each compo- 
nent during an average month. 

4 PURCHASE PRICE/XX— The pur- 
chase price of each component amor- 
tized over an XX month period. The 
period of amortization is selected by 
the user and defined to SCERT 
through the Environment Definitions. 

5 MONTHLY MAINTENANCE— Rep- 
resents the maintenance charges for 
each purchased component during an 
average month. 


MONTHLY COST— A total of* the 
amortized purchase price plus the 
monthly maintenance cost. This is a 
figure which can be compared to the 
monthly rental to assist in determin- 
ing purchase versus rental. 

DATA PREPARATION— A dollar 
figure representing the personnel costs 
required to prepare input data on a 
recurring basis to achieve the opera- 
tion of the total system. This projec- 
tion is based on factors furnished 
through the Environment Definitions, 
such as number of data preparation 
personnel and their mean salary, the 
user’s estimate of their data prepara- 
tion production, i.e., key strokes per 
hour; and the computations made by 
SCERT which reflect the total data 
preparation requirement for an aver- 
age month. 

8 EQUIPMENT OPERATION— A dol- 
lar figure representing the projection 
of equipment operation costs for an 
average month. This estimate is com- 
puted on data defined to SCERT, such 
as numbers and mean salaries of 
operation personnel ; and the computa- 
tions made by SCERT to determine 
the hours required for processing the 
entire system on the simulated config- 
uration. 

9 PROGRAM MAINTENANCE — An 
estimate of the user’s personnjel costs 
to maintain all computer runs in this 
system. 

10 DATA MEDIA COSTS— Represents 

the costs computed by SCERT for un- 
expendable data media necessary for 
the implementation of this system. 
Examples are magnetic tapes or any 
other type of interchangeable data 
storage media. 

DETAILED SYSTEMS ANALYSIS REPORT 
Report Explanation 

This report reflects detailed analytical infor- 
mation computed by SCERT on each program 
run in the entire system for each computer 
configuration evaluated. 
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Column E xplanation : 

1 PART I, INPUT/OUTPUT ANAL- 
YSIS — This part of the report is a 
complete analysis of all I/O functions 
in the system. 

2 INPUT/OUTPUT FILES— A column 
showing file name. 

3 FILE NUMBER — An identification 
number assigned by the analyst. 

4 NUMBER OF RECORDS— The num- 
ber of records for each of the files. 

5 RECORD SIZE — The optimized rec- 
ord size computed by SCERT for each 
file. 

6 RECORDS PER BLOCK— The opti- 
mum file block size computed by 
SCERT, 

7 NUMBER OF REELS — The number 
of reels required for each tape file. 

8 INPUT/OUTPUT MEDIA— A code 
specifying the type of media assigned 
to each I/O file. If the computer is a 
multi-channel machine, SCERT deter- 
mines the channel assignment for each 
device and indicates it. 

9 BUFFERED TIME — An estimate of 
that I/O time which can be buffered 
(or overlapped) by other operations 
during each program run. 

10 UNBUFFERED TIME— A projection 
of that I/O time which cannot be over- 
lapped and is therefore net time dur- 
ing each program run. 

11 PART II, INTERNAL COMPUTA- 
TION ANALYSIS— During the sim- 
ulation of each program run, SCERT 
compiles by functional category the 
number of program steps, associated 
memory utilized by work areas, and 
required time for all internal compu- 
tation. 

12 FUNCTION — Based on the internal 
computer activity codes specified by 
the analyst, SCERT determines a 
distributional breakout for the four 
functions of Arithmetic Computation, 
I/O Control, Data Handling, and De- 
cision and Control. 


13 PROGRAM STEPS— The number of 
instructions to be programmed by the 
user and to be furnished by manufac- 
turer’s software. 

14 MEMORY USED— A total of the 
memory utilized for a function, includ- 
ing requirements for program steps, 
work areas, tables and I/O areas. 

15 INTERNAL TIME— Time required 
for the computer to perform a func- 
tion. 

16 PART III, SPECIAL TIME ANAL- 
YSIS — An open ended analytical tech- 
nique incorporated into SCERT to 
highlight for the program analyst the 
time consumed by those functions 
which are meaningful to the design of 
the system. 

17 PROGRAM INSERTION TIME— A 
projection of the optimum time re- 
quired to insert the program into 
memory based on available peripheral 
hardware. If SCERT determines that 
segmentation is required, the added 
time for inserting the various pro- 
gram segments is also computed and 
shown in this column. 

18 PROGRAM DELAY TIME, MULTI- 
MEDIA CHANGES— SCERT deter- 
mines the number of reels required for 
tape files, and if during a program run 
an insufficient number of tape stations 
are available for each multi-reel file, 
it computes the time delay for rew’ind, 
take-down and set-up time for each 
reel. 

19 END OF JOB REWIND TIME— The 
time required to rewind the longest 
reel. 

20 SET-UP AND TAKE-DOWN TIME 
— The optimized time required to set 
up peripheral devices before each run 
and to take down completed work after 
the run. It is based on the number of 
operators and an analysis of the runs 
previous to and subsequent to each 
run. 




A GENERAL-PURPOSE TIME-SHARING SYSTEM 


Jtdes I. Schwartz, Edward G. Coffman, and Clark Weissman 
System Development Corporation 
Santa Monica, California 


INTRODUCTION 

Since June 1963, a Time-Sharing System has 
been operational at the System Development 
Corporation in Santa Monica. This system was 
produced under the sponsorship of ARPA and 
has utilized ideas developed at both Massachu- 
setts Institute of Technology®’ and Bolt, 
Beranek, and Newman,^’ “ as well as some origi- 
nal techniques. Time-sharing, in this case, 
means the simultaneous access to a computer 
by a large number of independent (and/or re- 
lated) users and programs. The system is also 
“general purpose,” since there is essentially no 
restriction on the kind of program that it can 
accommodate. The system has been used for 
compiling and debugging programs, conducting 
research, performing calculations, conducting 
games, and executing on-line programs using 
both algebraic and list-processing languages. 

This pai)er is divided into four major dis- 
cussions. These are : ( 1 ) an outline of the capa- 
bilities provided for the user by the equipment 
and program system; (2) a description of the 
system’s operation, with an analysis of the sys- 
tem scheduling techniques and properties; (3) 
a somewhat detailed description of two of the 
currently operating system service programs; 
and (4) a conclusion and summary. 

CAPABILITIES FOR THE USER 
Equipment Configuration 

The major computer used by the Time- 
Sharing System (TSS) Executive is the AN/ 
FSQ-32 (manufactured by IBM). Also used 


by the system is the PDP-1 (manufactured by 
Digital Equipment Corp.), which is the major 
input/output vehicle for the various remote 
devices. 

The remote input/output devices available to 
users include Teletypes, displays, and other 
computers. These devices can be run from 
within SDC, and from the outside, with the 
exception of displays, which can be operated 
only a short distance from the computer. It is 
expected that computers to be used at remote 
stations will eventually include the CDC 160A, 
the DEC PDP-1, and the IBM 1410. (Currently 
only the 160A is being used, from an installa- 
tion 400 miles distant from the Q-32.) Figure 1 
is a description of the system’s remote equip- 
ment configuration. 



Figure 1. Remote Equipment Configuration. 
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The AN/FSQ-32 computer is a I’s-comple- 
ment, 48-bit-word computer, with 65,536 words 
of high-speed (2.5 usee, cycle time minus over- 
lap) memory available for programs, and an 
additional 16,384 words of high-speed memory 
available for data and input/output buffering; 
the latter memory is called input memory. The 
PDP-1 also has access to the input memory; 
thus, this memory serves as the interface be- 
tween the two computers. In addition, the Q-32 
has an extremely powerful instruction reper- 
toire, including access to parts of words for 
loading, storing, and arithmetic; it also has an 
extensive interrupt system. 

Figure 2 shows the principal components of 
the system and the important information-flow 
paths throughout the system. As implied in 
the figure, each main memory bank (16K 
words) is individually and independently ac- 
cessible by three control units : the central proc- 
essor unit, the high-speed control unit, and the 
low-speed control unit. High-speed I/O, low- 
speed I/O, and central processing can take 
place simultaneously out of different memory 
banks, or, with certain restrictions, out of the 
same memory bank. The high-speed and low- 
speed I/O operations originate, of course, from 



Figure 2. TSS Hardware System, 


requests by the central processor unit. The 
low-speed control unit can service two or more 
low-speed I/O devices simultaneously, while the 
high-speed devices can only be operated indi- 
vidually, mainly because their cycle time ap- 
proaches that of core memory. 

A memory-protection mechanism and an in- 
terrupt, interval (quantum) clock (not shown 
in the figure) are also integral parts of the 
TSS computer system. On a bank-by-bank basis 
only, the memory protection mechanism pro- 
vides the capability for inhibiting, under pro- 
gram control, the writing of information into 
one or more memory banks. The quantum clock 
has the following characteristics : 

1. It can be set under program control to a 
time interval (quantum) anywhere in the 
range from a few msec, to 400 msec. 

2. It can be made to interrupt computer op- 
erations after the set interval has elapsed, 
or after any power-of-two multiple of the 
set interval (up to a multiple of eight) 
has elapsed. 

3. Under program control, it can be acti- 
vated and reset. 

A summary of the pertinent device charac- 
teristics is given in Table I below. The disk 
file shown in Figure 2 is currently being in- 
corporated into the system. 

The Time-Sharing System as it Looks to the 
User 

The time-sharing user today communicates 
with the Time-Sharing System primarily by 
means of Teletype. He has at his disposal six 
basic commands to the system. Briefly, these 
commands are : 

• LOGIN : The user is beginning a run. With 
this command he gives his identification 
and a “job number.” 

• LOAD : The user requests a program to be 
loaded (currently from tape, eventually, 
from disk). Once this command is exe- 
cuted, the program is an “object program” 
in the system. 

• GO: The user starts the operation of an 
object program or restarts the operation 
of an object program that has been 
stopped. Once the user gives this com- 
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Table I. 

Characteristics of the AN/FSQ-32 Storage Devices 

DEVICE 

SIZE 

WORD RATE 

AVERAGE ACCESS TIME 

Core Memory 

65K 

2.5 /isec./wd. 

— 

Inut/Output 

Core Memory 

16K 

2.5 /xsec./wd. 

— 

Magnetic Drums 

400K 

2.75 jusec. /wd. 

10 msec. 

Disk File 

4000K 

11.75 /Asec./wd. 

225 msec. 

Magnetic Tapes 

16 

Drives 

128 ftsec./wd. 
(High density) 

5 to 30 msec, (no positioning), depending on 
whether the tape is at load point, and whether 


it is being read or written. 


mand, he can send Teletype messages to 
either his object program or the Time- 
Sharing System. 

• STOP : The user stops the operation of an 
object program. 

• QUIT: The user has finished a particular 
job. Upon receipt of the QUIT, the Time- 
Sharing System punches a card with cer- 
tain accounting information on it and re- 
moves the object program from the sys- 
tem. 

• DIAL: The user may communicate with 
other users or the computer operators with 
this command. 

In addition to these basic commands, the user 
has available to him a variety of on-line pro- 
gram debugging, or checkout, functions which 
give immediate access to any part(s) of an 
object program. 

Briefly, these debugging functions include: 

• Open: Displays the contents of the given 
memory or machine register and uses this 
as a base address for other debugging 
commands. 

• Modify open register address : Changes the 
address of the opened register by the given 
increment or decrement. 

• Insert: Inserts the given value into the 
opened register. 

• Mask: Inserts values by the given mask. 

• Mode: Displays values according to speci- 
fied mode (floating, decimal, octal, Hol- 
lerith). 


• Break point: When a specified point in the 
program is reached, notifies the user, and 
(on options) displays registers, and stops 
or continues the program. As many as 
five break points are allowed simultane- 
ously. 

• Dump: Dumps a given set of registers, 
either on Teletype or tape. 

The actual commands to perform these func- 
tions usually include a ssnnbol or address with 
one or two unique Teletype characters. 

AdditioncU Facilities Available to System Users 

The commands and devices mentioned so far 
are facilities available to users or users' object 
programs directly through the Time-Sharing 
System’s Executive. With these facilities one 
could run and debug programs that exist in a 
binary form. To make the system more useful, 
however, a number of additional devices (called 
service routines) are available to users. These 
are themselves run as object programs, so it 
is clear that there is no limit to the number of 
service routines that can eventually be made 
available. 

These service routines include programs to 
file and update symbolic information ; com- 
pilers; a fancy desk calculator; tape-handling 
routines; and a number of others including 
some advanced routines utilizing interpretive 
techniques. A more detailed description of these 
interpretive routines appears in the time-shar- 
ing applications section, below. 
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SYSTEM OPERATION AND TIME AND 
SPACE ALLOCATION 

System Operation 

The discussion so far has been primarily on 
the operation of the system from the user's 
point of view. The following is an over-all de- 
scription of the system and how it operates. 

Basically, the system operates as follows: 
All object programs are stored on drum, put 
there as a result of the LOAD command. When 
a program's time to operate arrives, or, pref- 
erably, ahead of this time, it is brought into 
high-speed memory. If bringing a program into 
its area in memory causes a storage conflict 
with another program, the latter must be re- 
stored to its place on drums (a process called 
swapping). A program's turn will end when 
it initiates an input or output request, when 
a machine or program error is detected, or 
when its time is up, the time allotted being de- 
termined prior to its turn. At the completion of 
its turn, its machine environment (e.g., accu- 
mulator, index-registers, etc.) is saved, and it 
either resides in memory until its next turn 
or is written on drums. This mechanism is con- 
trolled by the time-sharing Executive. 

As stated before, there is no restriction on 
the type of object program that can run in the 
system. Therefore, as much input/output 
equipment as possible is made available to ob- 
ject programs; thus, object programs may use 
tapes, displays, and Teletypes for input and 
output. Other computers can also be treated 
as input/output devices; further, disk storage 
is available to object programs. Since it is im- 
practical, in such a system, to have specific 
Teletypes or tapes referred to by object pro- 
grams, input/output is done in a general fash- 
ion, with all input/output devices given arbi- 
trary names by the object programs and 
declared to be files used by the object program 
during its run. Thus, only the Time-Sharing 
System knows what physical tape drivers. Tele- 
types, or areas of drums are being used. 

The Time-Sharing System's Q-32 Executive 
occupies 16,384 words of memory, leaving the 
remainder of memory for object programs. The 
Executive that exists in the PDP-1 is primarily 
concerned with maintaining the flow of infor- 


mation to and from the remote devices. It 
does relatively little decision-making. However, 
it does determine the kind of input/output 
device concerned, the type of conversion neces- 
sary (if any), and the particular channel of the 
device with which it is communicating. 

The time-sharing Executive in the Q-32 has 
eight major components. These include routines 
that perform input/output, perform on-line 
debugging, interpret commands, assign stor- 
age, and schedule object programs. By far the 
most distinctive feature of the time-sharing 
Executive, compared to other monitors or ex- 
ecutive systems, is the scheduler. Accordingly, 
a more detailed description of time and space 
scheduling follows. 

Time Allocation and User Capacity 

The first problem considered in the Time- 
Sharing System (TSS) scheduling design was 
the determination of the minimum amount of 
time to be given each program during a re- 
sponse cycle of the system. A response cycle is 
that period of time during which all active pro- 
grams (i.e., programs requiring central process- 
ing time) are serviced. Clearly, to satisfy 
TSS objectives, this quantum of time (q) must 
be at least as great as the average amount of 
time required by an object program to produce 
a response. Here, of course, we refer only to 
those programs designed to communicate with 
a user station (display or keyboard device), 
and to those programs for which a fast response 
is desired and can reasonably be expected. In 
other words, a user requesting a matrix in- 
version will (and must) expect to wait con- 
siderably longer than a user wishing only to 
see the contents of some register in his pro- 
gram. 

Initially, it was obviously not possible to 
determine a priori the distribution of object- 
program operating times, nor was it even pos- 
sible to define or classify the group of users re- 
quiring these data. The currently available in- 
formation regarding user programs, and, to 
some extent, the experience of others, indicated 
that a q of 50 msec, was sufficient. The extensive 
recording now being performed during TSS 
operation is accumulating data that will much 
more accurately indicate the necessary q size. 
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In the following section, “worst-case” situa- 
tions are being treated. “Worst-case” situa- 
tions are being treated because they, by defini- 
tion, give the overload threshold or capacity of 
the system ; because they simplify the problem 
of having to cope with the distributions of 
object-program sizes and operating times ; and 
because TSS will be operating at, or near, 
capacity for a high percentage of the time, if 
the present rate of usage continues. In some 
cases the “worst-case” values that are used had 
to be estimated. There is considerable evidence, 
however, to support the estimates given in the 
following approximation of the maximum num- 
ber (Umax) of active users that can be serviced 
in one response cycle, when given the size of the 
response interval (tr), the quantum size, and 
the hardware constraints. 

In the current version of TSS the “worst- 
case” response cycle consists of the following 
recurrent, non-overlapping sequence of opera- 
tions: dumping of the last program operated; 
loading of the next program to operate ; alloca- 
tion of the time interval for operation. For the 
values of q and t, that are of interest, the num- 
ber of active programs in the system can be 
much larger than the number of memory-pro- 
tected programs that can be held in core 
memory at one time; therefore, the above se- 
quence will virtually always be necessary for 
the operation of each program. 

Assuming (as is presently the case) that 
object programs are not relocatable, we have 
(in view of the regular, cyclic operation of 
TSS) the following simple relation, 

tr(l — •>?) 

~ 2t»-fq ^ ^ 

where t, represents an average value for the 
time it takes to transfer a program from drum 
storage to core memory or vice versa, and 1 / is 
the fraction of time (overhead) used by the 
Executive during each response cycle. 

The fraction of overhead (y) is a difficult 
quantity to evaluate, and it depends to some 
extent on n„,,,x. Because of the complexity of 
TSS operation, it is also difficult to estimate 
r) through recording during TSS operation. 
From experience to date with the system, it is 
estimated that t ranges from two per cent to 


fifteen or twenty per cent depending on existing 
circumstances. 

Equation (1) shows that, without major re- 
visions in hardware, a significant inprovement 
in nmai can be achieved only through a decrease 
in the quantity (2t, -j- q). In particular, if 
object programs can be made dynamically re- 
locatable, this quantity can be reduced to the 
value of 2t, alone. Clearly, this is the best one 
can do, simply because the speed of the high- 
speed I/O section in swapping programs in an 
uninterrupted sequence represents a funda- 
mental upper bound on TSS capacity. Further 
improvement necessitates an extensive increase 
in core-memory size, so that at least some active 
programs can remain in memory during con- 
secutive response cycles. An increase in nm,x 
brought about by an increase in the speed of 
the high-speed I/O section is not economically 
feasible as can be seen from the equipment de- 
scription given earlier. 

Assuming dynamic relocatability of pro- 
grams, equation (1) changes to: 

n„u,s ^ ( 1 — T) ) tr/2t, ( 2 ) 

In practice, the extent to which the optimum 
is attained depends on the distributions of 
object-program sizes and operating times. If 
2ts is substantially larger than q. Equation (2) 
can, for all practical purposes, be considered an 
equality. Relocatability at load time would, of 
course, also significantly increase nmax, but the 
improvement that could be expected would be 
substantially less than that given in Equation 
(2). For a more specific evaluation of the im- 
provement, a knowledge of the distributions 
just mentioned is necessary. 

The linear relationships between nmai and t, 
given by Equations (1) and (2) are shown 
graphically in Figure 3 with the following 
parameter values : q = 50 msec., t/ = 0.20, and 



Figure 3. Response Time vs. Number of Users. 
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2ts = 100 msec, (corresponding to a program 
size of 16K words). At present, a value of 2.0 
sec. is being used for tr, with a resulting n^.^ 
of about 11. 

Up to now, the efficiency of central processor 
utilization (relative to unoverlapped I/O time) 
has been considered of secondary importance, 
providing that user requirements have been 
met. Admittedly, this computational efficiency® 
is rather low in the “worst-case” situations. 
As will be seen in the next section, however, the 
way in which object programs are sequenced 
tends to maximize this efficiency for any given 
load situation. Clearly, the installation of 
dynamic relocatability in the system would 
allow an efficiency up to 50% since q can be 
made equal to 2t, without affecting Equation 
( 2 ). 

It should be emphasized that nn,«K does not 
represent the maximum number (N„ax) of user 
stations that can be active at one time ; it rep- 
resents only the maximum number of user 
programs that can be serviced in a fixed re- 
sponse interval under the assumptions given 
earlier. It has been conservatively estimated 
that the associated object program is in need of 
central processor time only ten to twenty per 
cent of the time during which a user station is 
in use. Accordingly, it may be possible to 
make considerably larger than n^ax with- 
out significantly jeopardizing user-response 
requirements. Three important factors figure 
in the estimate of p =r nmax/Nmai : 

1. Relative to computer processing speeds, 
many applications (e.g., debugging, gam- 
ing) consume considerable user time in 
thinking and output analysis. 

2. The average user is less than professional 
in his use of input devices. A slow 
manual-input rate, coupled with occa- 
sional typing or format errors, will cer- 
tainly tend to make p small. 

3. Generally, computer output to user sta- 
tions takes as much as one to ten seconds. 

The estimate of p given above was based on 
the observation of these three factors during 
system operation and has been justified by the 
results of the limited amount of recording 
currently available. In obtaining the precise 


distribution of the quantity p it will be possible 
to determine the probability of overload for a 
given Nmai, or to determine the Nmax necessary 
for a given probability of overload. It should 
also be pointed out that, ultimately in TSS, as 
in a telephone exchange, several more user sta- 
tions may be allowed than can actually be in 
use at one time. The extent to which Nmax can 
be exceeded must again be determined by a 
distribution obtained in the same manner as 
for p. 

Sequencing and Priorities 

The sequence in which object programs are 
allocated time is determined by a priority 
scheme that favors the smaller programs that 
do not use low-speed I/O time. The amount of 
time allocated is given by the total time avail- 
able (tr), divided by the current number (n) 
of object programs requesting central process- 
ing time. When n= nma^i, the time allocated is 
given by the minimum quantum discussed in 
the previous paragraphs. 

The priority scheme was adopted to prevent 
low-speed I/O that was initiated by object pro- 
grams from degrading the response of those 
users not using low-speed I/O. Users whose 
programs require low-speed I/O must expect 
poorer response, not only because of the low- 
speed operations, but also because of possible 
conflicts in object-program I/O requests. Each 
object program in the system receives a prior- 
ity according to the criteria in Table II. 


Table II. Priority Criteria 


PRIORITY 

PROGRAM 

CHARACTERISTICS 

1 

Program is less than 16K and does 
not use low-speed I/O. 

2 

Program is less than 32K and uses 
low-speed I/O or, program is be- 
tween 16K and 32K and no low- 
speed I/O. 

3 

Program in excess of 32K. 


During any given interval of time. Priority 
1 programs will receive service first; Priority 
2, second ; and Priority 3, last. To prevent deg- 
radation of response by low-speed I/O, main 
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Figrure 4. Main Memory Allocation. 


memory is partitioned and allocated as shown 
in Figure 4. Because of the relatively small 
number of current TSS users, this storage al- 
location procedure has not yet been imposed on 
object programs. In the future when the num- 
ber of Priority 2 and 3 users begins to cause a 
significant degradation in Priority 1 response, 
this scheme will be fully implemented as de- 
scribed. 

Figure 4 shows that Priority 1 and 2 pro- 
grams can be multiplexed, but Priority 3 pro- 
grams preempt practically the entire machine. 
The priority scheme cannot solve the problem 
arising when a Priority 3 program undertakes 
a lengthy, low-speed I/O transfer. The majority 
of programs using low-speed I/O, however, 
concern tape transfers, which involve no 
searching, that take from 50 to 75 msec. 

When a program completes operation prior 
to the expiration of its time allocation for any 
of the reasons given in the second paragraph 
of this section, the remaining time will be re- 
distributed among the remaining users re- 
questing service. As a result, the large Priority 
2 and 3 users will generally receive more time 
than the Priority 1 users, thus increasing the 
potential utilization of central-processor time. 

Space Allocation 

Although the timing and speed limitations on 
TSS capability have been of concern, storage 
limitations are presently far more severe. Stor- 
age limitations can be largely removed, how- 
ever, by acquisition of additional drum space 
up to the maximum of about 600K. Figure 5 
gives a rough idea of how much drum storage 
must be provided for object programs, to 
achieve a balance between the speed and capac- 
ity of the system. The curves are obtained by 
letting nmtix pNmnx in Fq. (1), t, 2.0 sec., 
Tj = 0.2, and p = 0.2. 
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Figure 5. N™., = 

p ^t. -f q 

In the initial TSS model (with only eight 
TTY users), auxiliary memory drum storage 
was partitioned and allocated in a fixed man- 
ner to provide an early working model of the 
system. This technique proved quite satisfac- 
tory at the time, but the number of input sta- 
tions has now increased to about 48, To ac- 
commodate the additional users, a more effi- 
cient use of drum storage was necessary. The 
present method meets this requirement by al- 
locating storage in a contiguous, “head-to-tail” 
fashion. The adapatability of this storage-al- 
location method requires searching an inven- 
tory of available drum space each time a new 
program enters the system, and periodically 
redistributing drum space to maximize the 
available amount of contiguous drum space. A 
possible disadvantage of this method is the 
additional overhead produced, especially when 
programs must be reshuffled to allocate a 
sufficient amount of contiguous drum space 
for a new program. Here again, the perform- 
ance of this storage allocation technique must 
be evaluated by statistical recording, since the 
performance depends strongly on the distribu- 
tion of program sizes, and on the rates at 
which programs enter and leave the system. 
However, at present (and in the foreseeable 
future) the above rates are so low that the ad- 
ditional overhead produced is negligible. 

Future Improvements in TSS Scheduling 

There are many ways, including both hard- 
ware and software additions to the system, in 
which the capacity and scheduling efficiency of 
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TSS can be enhanced. The more or less obvious 
hardware improvements include : 

• Additional core memory 

• Additional drum memory 

• Relocation mechanism 

• Disk storage 

The effects of a relocation mechanism and ad- 
ditional drum storage have been described in 
the previous sections. Additional main memory 
can be expected to allow for a larger Executive 
system, larger object programs, and greater 
scheduling efficiency. However, a substantial 
improvement in scheduling efficiency must be 
predicated on the existence of a relocation 
mechanism, when one makes the obvious as- 
sumption that the memory size is small com- 
pared to the total size of all active object pro- 
grams. 

The disk file, which is just now being in- 
stalled and checked out, will supplant tapes for 
all those applications in which disks are faster 
and use less machine time. It is expected that 
disks will be used to store a program library 
for TSS usage and to store large data bases for 
object program usage. It is conceivable that 
disks will also be used for program swapping. 
The first use of the disk file promises to elimi- 
nate a high percentage of manual operations 
associated with program loading from tapes, 
and thus to reduce greatly the corresponding 
delay experienced by users. The second use 
should save considerable time for the user 
whose application involves searching through 
large data bases. 

Although the estimates given in this paper 
are based realistically on current experience, it 
is not unlikely that user characteristics will 
evolve quite differently than predicted. Pro- 
gram sizes and/or operating times may grow 
to a point that invalidates the “worst-case” 
figures given in this section. It is possible, 
however, to counteract a certain amount of this 
degradation by certain improvements in sched- 
uling logic. One improvement would be ob- 
tained by taking advantage of the fact that a 
fairly large class of users exists for whom re- 
sponses substantially greater than one or two 
seconds are quite acceptable. In short, it is 
possible to assign response levels to each user 


and to service each user just frequently enough 
to ensure his level of response. Furthermore, 
the disk file can be used for swapping those pro- 
grams for which short responses are not neces- 
sary. Provided that disk access is in parallel 
with other high-speed I/O activities, the effec- 
tive swapping speed can retain the same order 
of magnitude as for drums. 

There are many programs that do not alter 
themselves during their execution. Thus, as 
another software improvement, these pro- 
grams could be treated by the system in two 
sections : an instruction section and an en- 
vironment (data) section. During a program’s 
execution it would never be necessary to write 
the instruction section back on drums; only 
the environment section and the machine con- 
ditions at interrupt would be written back on 
drums. These and other improvements to TSS 
are under present investigation. Of principal 
concern in the investigation of these system 
changes is the amount of overhead they pro- 
duce. In some cases the increase in overhead 
exceeds the expected “improvement” in operat- 
ing speed and efficiency. 

TIME-SHARING APPLICATIONS 

To illustrate the “general purpose” nature 
of the Time-Sharing System, we focus on two 
interesting programming systems currently 
operating on TSS as service systems for the 
user. The first, IPL-TS, is a complete list- 
processing system for the Information Process- 
ing Language V developed by Newell, Simon, 
and Shaw.12 The second, TINT, is an on-line 
Teletype /A/Terpreter for theJOVIALalgebraic 
language developed by SDC.^®’ ” When the 
Time-Sharing System is equipped with these 
two programming-language systems, the user 
is immediately provided with a familiar pro- 
gramming system to ease his transition to 
programming for time-sharing, and allowed to 
use, with little or no modification, any code he 
may have previously written in IPL-V or 
JOVIAL for other machine systems. 

IPL-TS Description 

IPL-TS executes interpretively IPL-V code 
written in accordance with the latest published 
IPL-V conventions.!® Some exceptions are 
noted, particularly in the IPL-TS I/O conven- 
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tions dictated by machine limitations and time- 
sharing procedures. More significant, how- 
ever, are the extensions provided by IPL-TS in 
the areas of mode of code execution, and im- 
proved on-line communication. 

IPL-TS can operate in one of two modes at 
the programmer’s option : the “production” 
mode or the “debugging” mode. The production 
mode is designed for maximum code execution, 
and is used essentially for checked out code. 

Though code is still executed interpretively, 
^ suppression of all debugging functions in the 
production mode has produced a four-fold in- 
crease in execution rate over debugging-mode 
operation. Execution rates of over 400,000 
cycles per minute, which compare favorably 
with other non-time-sharing IPL systems, are 
common. To the IPL-TS user, production-mode 
operation is analogous, as we shall see later, to 
the TINT user compiling his checked-out code 
with the Time-Sharing JOVIAL Compiler 
(JTS). Debugging-mode operation, on the 
other hand, is designed for maximum user 
efficiency and greater on-line programmer con- 
trol over the execution of his program. The 
debugging mode allows all the standard IPL 
options; it also permits a number of on-line 
functions not common to IPL systems. These 
include: 

1. Optional breakpoint action at any moni- 
tor point, whereby the currently execut- 
ing program is suspended until comple- 
tion of the execution of any on-line, pro- 
grammer-specified routine ; 

2. On-line, symbolic program composition 
and/or debugging; 

3. Optional automatic or on-line program- 
mer-controlled execution of a full “back 
trace” routine that prints up to the last 
100 interpretation cycles. This routine 
is executed by IPL-TS automatically at 
each system-detected error occurrence as 
a debugging diagnostic ; and 

4. A flexible, “thin skinned” system error 
trap mechanism permitting programmer 
specification of trapping actions for all 
system-detected errors. 

TINT Description 

TINT is a two-pass interpretive program 


system for time-sharing use, and operates upon 
a subset of the JOVIAL problem-oriented 
language. 

TINT includes a generator, a set of operator 
subroutines, and the interpreter. The genera- 
tor was acquired from a current IBM 7090 
JOVIAL compiler and was modified to handle 
the particular JOVIAL dialect used by TINT. 
The operator subroutines and the interpreter 
are original code developed specifically for 
TINT. 

The generator (first pass) scans the input 
JOVIAL statements and translates them into 
an intermediate Polish prefix language. Cram- 
mer checking is performed during the transla- 
tion. The language subset allowed may include 
the arithmetic, relational, and Boolean opera- 
tors; procedure calls; data table, array, and 
item (integer, floating point, and Hollerith) 
declarations ; and the GOTO, IF, STOP, READ, 
and PRINT statements. The READ and 
PRINT statements were added to the language 
specifically for time-sharing operation. 

Operator subroutines comprise the primitive 
functions used by the interpreter to perform 
the actions specified in the intermediate 
language. The interpreter (second pass) scans 
the intermediate language for the current op- 
erator prefix and its arguments, and executes 
the corresponding operator subroutine that 
computes on these arguments. 

The TINT user is permitted a number of 
options in composing and executing his code*. 
He may reference code stored in a binary 
library tape of his own composition ; he may file 
away any current code on tape for subsequent 
use, or for compilation with JTS after the 
code has been exercised and debugged ; and, he 
may optionally execute code from a prestored 
tape or from the Teletype. 

On-Line Program Composition 

Both IPL-TS and TINT allow the user to 
write symbolic programs on-line and to execute 
them immediately, by themselves or in conjunc- 
tion with previously coded routines. With IPL- 
TS, the programmer uses the special system 
routine. Linear IPL (LIPL),* which accepts 


* LIPL was designed and coded by R. Dupchak while 
consnltant to the RAND Corporation, Santa Monica. 
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Figure 6. Typescript of Ackermann's Function. 

IPL code on-line in a symbolic, linear, paren- 
thesis format convenient for keyboard input. 
Figure 6 presents an example of LIPL being 
used to compose and execute Ackermann’s func- 
tion® on-line. TINT, which was developed 
specifically for on-line program composition, 
accepts JOVIAL statements on-line in the same 
linear format used for compiler input. 

The ability to program on-line frees the pro- 
grammer from having to concern himself with 
all the formalities of punched card accounting. 
With experience and facility, he programs on- 
line directly from his thoughts or, for more 
difficult problems, directly from a flow diagram, 
circumventing such time-consuming tasks as 
program-coding-sheet preparation, key punch- 
ing, card sorting, editing, and prestoring. The 
time saved by the programmer can be applied 
to other coding tasks or to quality review of his 
current code. 

No programmer, of course, could compose a 
large program at one sitting with either of 
these systems, but this is a human, not a sys- 
tem, limitation ; LIPL has no upper bound, and 
TINT’S 600-statement limit effectively exceeds 
a human’s short-term comprehension. Opti- 
mally, these systems should be used for pro- 
grams that can be written and debugged in one 
or two sittings (usually under 100 IPL instiuc- 
tions or 50 JOVIAL statements). 

There are three immediate consequences of 
this practical size limitation. First, many non- 
trivial, one-shot programs, such as for statist!- 


LOGIN 0173 JDX.25 
5.0K LOG ON 14 
LOAD TINT 1796 
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GO 
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STAKT "BEGINS NEW PROGRAM” 

ITEM NFS “NUMBER OF CASES” 

ITEM SUMX F S “SUM OF VALUES" 

ITEM XBAR F S “AKITH. MEAN” 

ITEM SDEV F S "STD. DEVIATION" 

READ N« SUMX S 
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N » ? 12.0 

SUMX * ? 147H.0 

MEAN « 123.2 
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ILT EXECUTION COMPLT 

!<#UIT 
SMSG IN 

Figure 7. Example of the Use of TINT as a 
“Desk Calculator.” 

cal computations, can be coded, debugged, and 
executed at one sitting. Often a programmer 
himself will refrain from writing such pro- 
grams, knowing the time and effort involved. 
Figure 7 shows the Teletype communication re- 
sulting from an exercise using TINT as a “desk 
calculator'' for computing the standard devia- 
tion of a set of research data. Second, large 
programs take on a modular structure ; that is, 
large programs become a concatenation of 
numerous smaller programs and subroutines. 
Third, programmers begin to amass personal 
libraries of short utility subroutines, which 
they use to build larger programs. Clearly, 
consequences two and three would not exist, 
except in trivial cases, if it were not possible 
to work one day with code developed cn prior 
days. Both IPL-TS and TINT provide this ca- 
pability. 

TINT may accept symbolic input from 
magnetic tape, and can integrate this input with 
on-line Teletype input when so directed by the 
user. Thus the results of one day's coding can 
be filed on tape for later use. An alternative, 
if the symbolic JOVIAL statements have been 
executed and debugged, is to compile the code 
and save the binary output on a binary library 
tape, thus, again, integrating previous work 
with current code ; however, the binary library 
approach has greatest value when used for 
utility routines. 
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Figure 8. Accessing the Computer with Model 33 
Teletypes and Displays. 


IPL-V is essentially a language of sub- 
routines (composed from an inventory of some 
200 system subroutines called J routines or 
primitives). Programs written in IPL-V are 
usually modular hierarchies of subroutines. 
Therefore, on-line composition of IPL-V pro- 
grams is a natural extension of the language, 
and many alternatives for continuity of pro- 
gramming across many days of operation 
already exist within the language. For ex- 
ample, the programmer may “fire” a J166 (Save 
For Restart) at any time and continue from 
that point at a later date, or he may load a pro- 
gram from symbolic tape using the loader or 
J165 (Load Routines and Data) and continue 
using LIPL on-line. 

Therefore, the attributes of IPL-TS and 
TINT, when combined with a programmer’s 
imagination and skill during on-line program 
composition, reduce significantly the tedious, 
uncreative tasks of code preparation and in- 
crease productivity. This point is particularly 
apparent to all programmers who have been 
required to debug code that they wrote several 
days earlier, and that has grown “stale” while 
it was being keypunched, compiled, and ex- 
ecuted. Instead of expending additional time 
and energy becoming reacquainted with his 
code before he can correct his errors, the pro- 
grammer can, by composing the code on-line 
and executing it immediately, debug while the 
code is still fresh in his mind. 

On-Line Program Debugging 

The particular ability of IPL-TS and TINT 


to detect, locate, and correct program errors on- 
line is perhaps their greatest asset, since it 
leads to substantial decrease in program turn- 
around time. In effect, IPL-TS and TINT in- 
crease the programmer’s debugging efficiency 
by allowing him to check out more code per day 
than would be possible with non-time-sharing 
operation. 

Error Detection is the first step in debugging 
any program. Errors may be classed as either 
grammatical errors in language or format, or 
logical errors in code execution. The genera- 
tor screens out most grammatical errors for 
TINT, and either the loader or LIPL performs 
the same task for IPL-TS. Logical-error detec- 
tion, however, is a more difficult task, even with 
IPL-TS and TINT. The advantage of these 
systems for error detection is their responsive- 
ness to the programmer. He may choose to 
develop on-line, special-purpose debugging tools 
to suit his individual preference, or he may use 
those debugging tools provided by the system. 
For example, IPL-TS currently provides an 
error trap for some twenty illegal IPL opera- 
tions resulting from faulty program logic ; 
when such errors occur, IPL-TS attempts to 
provide the programmer with as much informa- 
tion as possible to help him correct his error. 
First, an error message is sent to the program- 
mer to inform him of the error’s occurrence 
and of its nature. Second, a special system 
routine. Trace Dump (discussed below), pro- 
vides him with a “back trace” of the code lead- 
ing up to the error to help him locate the cause 
of the error. Finally, the system pauses at a 
breakpoint, to allow him time to correct the 
error. However, all three steps may be altered, 
since the IPL-TS error trap mechanism is de- 
signed with a “thin skin” to allow the program- 
mer to substitute his own trapping action in 
lieu of that provided by the system. 

With TINT, logical-error detection is left 
more to the imagination of the programmer. 
TINT allows the programmer to insert a 
PRINT statement, with numerous item names 
as arguments, at any point in his program. 
When it encounters this statement during pro- 
gram execution, TINT responds by printing on 
the user’s Teletype the current values of all 
specified items. In this fashion, the program- 
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mer may take item snapshots at critical points 
in his program. The power of the PRINT 
statement for logical-error detection is ampli- 
fied when combined with the TINT READ 
statement. The READ statement is the con- 
verse of the PRINT statement. When TINT 
encounters this statement during program ex- 
ecution, the programmer must insert the cur- 
rent values of prespecified items. By judicious 
use of the READ and PRINT statements, the 
programmer can repeatedly exercise a program 
with different initial conditions and review his 
results with input/output transfer-function 
analysis. 

Thus, on-line user-program communication 
increases a programmer’s debugging efliciency 
by increasing his ability to detect program 
errors. It is typical for a programmer, check- 
ing out new code with IPL-TS or TINT, to 
detect and correct half a dozen program errors 
in the first hour of operation; such error cor- 
rection might easily have required a week with 
conventional programming systems. 

Error location, the pinpointing of the errone- 
ous code, is often considered no different from 
error detection. This may be true for gram- 
matical errors, but is far from true for logical 
errors. The knowledge that an error exists does 
not, in and of itself, narrow the search for the 
error’s location. The user of IPL-TS, there- 
fore, is provided with a description of the 
system-detected error and the aforementioned 
back trace of the code leading up to the error. 
Back tracing by the system is performed in the 
debugging mode by the special system routine 
Trace Dump, which prints a full trace of up to 
the last 100 interpretation cycles, in reverse 
order (last cycle first). The number of previ- 
ous cycles printed is controllable on-line. Ex- 
perience shows that the location of an error can 
usually be found within the first five cycles 
printed, and that it is rarely necessary to go 
deeper than ten cycles back. For logical errors 
not detected by the system, the programmer has 
available all the standard IPL-V Monitor Point 
functions; in addition, IPL-TS extends these 
functions to include breakpoint operation as a 
programmer-initiated option. The option may 
be invoked at load time or during program ex- 
ecution. In addition, the IPL primitive J7 
(Halt) has been implemented as an alternative 


breakpoint mechanism. When a breakpoint is 
encountered by IPL-TS, the programmer is 
notified and requested to enter the name of any 
regionally defined routine, which is then ex- 
ecuted immediately. Upon completion of the 
routine, the programmer is again queried. He 
may continue to fire routines at the breakpoint, 
or he may exit back to the prior program, the 
context of which has remained undisturbed. 

Breakpoints are not a panacea for locating 
erroneous code; however, they do provide ad- 
ditional control flexibility at critical points in a 
program. In fact, the user of TINT must rely 
almost exclusively on breakpoint logic for 
locating erroneous code : the aforementioned 
READ and PRINT statements are in effect 
breakpoint statements. For elusive errors 
these statements may be used to bracket groups 
of JOVIAL statements, and in extreme cases, 
individual JOVIAL statements. TINT also 
provides a STOP statement, which is also a 
breakpoint statement. When the interpreter 
encounters the STOP statement, the program 
is suspended until directed by the user to con- 
tinue. The user may also reexecute his program 
from a STOP breakpoint, or he may enter new 
code or edit prior code before continuing. 
tint’s stop statement is analogous to the 
IPL-TS J7 (Halt) primitive. 

Error correction in symbolic code with either 
IPL-TS or TINT is essentially on-line program 
composition. LIPL allows the IPL program- 
mer to erase, extend, or modify selectively any 
user routine existing in the system. TINT, 
similarly, allows the programmer to edit any 
JOVIAL code written, on a statement-by-state- 
ment basis. 

Here, again, the programmer’s control over 
his program is effectively increased. He can 
correct code in several minutes instead of the 
several days typical with most computer in- 
stallations. 

SUMMARY AND CONCLUSION 

There are some obvious advantages to this 
kind of system that have been borne out in 
practice. There is a large class of problems 
whose compute time is extremely small in 
relation to the total time the problem is on the 
computer. This is because a large percentage 
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of time is taken up by human thought and com- 
puter input/output. In fact, the use of a com- 
puter for this kind of application in a non- 
timesharing mode is so inefficient that it 
would not be worthwhile to run. There are 
many examples of this kind of problem. The 
one that most programmers are familiar with 
is console debugging, that is, the checkout of 
programs with the programmer at the com- 
puter — anathema to most computer managers, 
but desired by a large number of programmers. 
These kinds of applications have been run with 
a high degree of success in this Time-Sharing 
System, with each person involved actually 
feeling he has the whole computer to himself. 

At the other end of the spectrum are those 
programs that compute for essentially one 
hundred per cent of the time they are on the 
computer. If these programs compute for long 
periods, say a matter of minutes, they will 
completely usurp their allotted time and thus 
tend to make the on-line user wait for the maxi- 
mum response period possible. Time-sharing 
does not benefit this kind of user, except that 
this kind of program can be run “in the back- 
ground” while other on-line interaction pro- 
grams are idle. In the SDC installation, the 
percentage of these long-period compute pro- 
grams has been small, so that no serious system 
response time delays have been noticed from 
them. 

Questions frequently asked are, “Do people 
like the system?” “Does it produce better re- 
sults than other, more standard techniques?” 
Both the questions are difficult to answer in an 
absolute sense. However, some reasonable ob- 
servations can be made that apply to this sys- 
tem and probably to others of this kind. 

First, those on-line interaction programs that 
used to run in a non-time-sharing mode but 
were converted to time-sharing produce results 
that are as valid as before but with greater 
efficiency in computer operation, since a num- 
ber of different ones are run simultaneously. 

Next, the on-line debugging capability has 
proved very valuable. This system of debug- 
ging gives a feeling of closeness to the computer 
and control over the program, so that debug- 
ging time is reduced considerably while the 
efficiency of computer utilization stays high. 


Also, although the tools available so far have 
been relatively few and unsophisticated, one 
can see the advantages to be gained by giving 
everyone immediate access and response from 
a computer. “Directed” computer runs are the 
mode of operation. Every step taken is taken 
only as a result or verification of the previous 
step. If things do not go as planned, alternative 
paths can be followed immediately. Before 
time-sharing, one had two choices: “submit- 
ting” of a run, followed by an anxious waiting 
period climaxed by a sigh (or worse) and a re- 
submitting of the same run ; or one-man on-line 
interaction with the computer, which benefitted 
that person, but caused consternation on the 
part of others waiting for computer runs. 

This kind of system must be made foolproof. 
Due to the nature of this system, one must have 
a reasonably long time of uninterrupted opera- 
tion to get satisfactory results. This implies 
several things : 

1. The system Executive must be reliable 
and able to account for any condition that 
may arise, including object program and 
machine errors. 

2. The machine must be reliable. Although 
the system must provide the ability to 
analyze each computer error and isolate 
and stop only the particular object pro- 
gram or programs affected, frequent or 
solid computer errors can cause the en- 
tire system and all object programs to 
•terminate. 

3. Certain hardware features are essential. 
These include: Memory protection — the 
ability to prevent object programs from 
destroying each other or the Executive 
system ; and high-speed large-storage 
random-access devices — ^the major bottle- 
neck in a system of this kind is the slow 
rate at which object programs can be 
moved in and out of memory. Also, the 
use of magnetic tapes for such functions 
as the permanent storage of programs 
and data files creates operational and 
timing problems that can be overcome 
with the use of large drums or disks ; also 
essential is clock interrupt capability — 
the system requires that no single pro- 
gram run for an excessively long time. 
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Therefore a clock that can be set to in- 
terrupt operation at various intervals is 
necessary for complete control and the 
assurance of adequate response time. 

When this Time-Sharing System first became 
operational, it had no memory protection, its 
Executive was unreliable, and its computer was 
beset by a much heavier load than it was used 
to and reacted accordingly. With these ob- 
stacles, the early users were subject to frus- 
trations unlike many found in the twentieth 
century. The system’s life expectancy was no 
more than ten minutes. The only remarkable 
thing about the early months was that anything 
useful was accomplished. Interestingly enough, 
however, some work was accomplished, pri- 
marily through patience on the part of the 
users. With the passage of time, many of the 
problems have been alleviated through both 
equipment and programming improvements, so 
that now the system runs with considerably 
more continuity and reliability. 

Since the system became operational, it has 
been used in a wide variety of applications. 
These applications have, for the most part, 
been checked out using the Time-Sharing Sys- 
tem and have been run productively during 
time-sharing. Some of the specific applications 
for which time-sharing has been used are : 

• Natural Language Processors — ^used for 
parsing English sentences, answering 
questions, and interpreting sentence- 
structured commands. 

• Group Interaction Studies — in which 
teams or players are matched against each 
other and the computer is used to measure 
individual and team performance. 

• General Display Programming — in which 
the programs are used as vehicles for gen- 
erating and modifying visual displays ac- 
cording to the users’ keyboard inputs. 

• A FORTRAN-to-JOVIAL Translator- 
symbolic JOVIAL program tapes are pro- 
duced for FORTRAN tape inputs. 

• Simulated Alternate Mobile Command 
Post — a realistic simulation of the A.M.C.P. 
has been produced, and the display require- 
ments for this organization are studied 
within this framework. 


Of course, a number of other routines, games, 
and services have been and are being developed 
under the system. 

One of the “disadvantages” in using a time- 
sharing system such as this is the fact that most 
computer runs require the presence of one or 
more people. Users of many large-scale com- 
puters are accustomed to remaining detached 
from the actual computer runs and are some- 
times reluctant to follow the runs closely. How- 
ever, the elapsed time for completing jobs 
using these “on-line” techniques is normally 
dramatically reduced compared to a more re- 
mote operation, and this reduced time has been 
noted in the use of the time-sharing system. 

It is interesting to watch a group of people 
using a computer simultaneously but solving 
different problems using different tools. At the 
computer console itself, one can usually see all 
the available tape drives busy, typewriters 
busy, drum indicators indicating the drums are 
busy, the punch punching, on occasion, and the 
card-reader going at anywhere from quarter 
to full speed. For those who judge the worth 
of a computer by the amount of equipment used 
per second, time-sharing is well worth its in- 
vestment. 

Since the system has been under development 
(it was begun in January 1963), the number 
of existing services has been expanding rapidly. 
One can envision the development of an in- 
creasing number of on-line programming aids 
and techniques of utilizing keyboards, displays, 
and groups of computers to make a time- 
sharing network a truly powerful device. 

It is certainly conceivable that, in the not too 
distant future, many people will have at their 
fingertips a device that, at a reasonable cost, 
enables them to enter an operating network 
such as this one. While in this network, they 
will have access to routines, techniques, and 
computing power unavailable to them by other 
means. The computing power will include not 
only the Executive computer but the other com- 
puters that are in the network as well. Thus, 
the possibility of large-scale time-sharing net- 
works seems to be one of the more promising 
developments in computer technology today. 
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INTRODUCTION 

Background 

Remote computing has been around as long 
as computers themselves. ^ More recently, in- 
terest has revived in providing remote users 
with convenient, economical access to a large 
central computer. Considerable attention has 
been addressed to its economics^ and practical- 
ity,^ Several batch-oriented systems have been 
implemented.'^’ ^ The techniques of time-shar- 
ing®’ a large^’ or smalD^ system have been 
described, as have the attendant advantages of 
man-machine interactioni^- is fQj, symbolic 
mathematics^'^ and program testing. Several 
input-output devices have been considered, in- 
cluding typewriters, 1® displays, and dial- voice 
equipment.i^ 

The management, 1®’ 2®* 21 systems analysis,22 
program testing, 2 ^ and documentation24 of spe- 
cialized real-time systems have also been em.- 
phasized, but much less attention has been 
given to the design of general-purpose on-line 
systems. 

This paper reviews some general system re- 
quirements and applications criteria leading to 
basic design objectives and constraints for 
remote-computing systems. An experimental 
system using a number of remote terminals 
time-sharing a standard computer is then de- 
scribed. 


System Requirements 

There are several requirements that must be 
considered when designing a practical remote- 
computing system. 

1. The remote user does not have access to 
experts for programming assistance and 
advice. If he uses a problem-orierited 
language to express his problem, he re- 
quires that the request for and display 
of debugging data be consistent with this 
programming language. 

2. Because his jobs are processed completely 
without human intervention, the remote 
user obviously cannot communicate his 
desires to a machine operator. This leads 
to several considerations : 

a. The command statements used to regu- 
late the system should have a form 
and content consistent with the pro- 
gramming languages employed. 

b. The remote user requires a powerful 
command structure; he should have 
the ability to state such things as run 
time, job status, error procedures, and 
disposition of output. 

c. The conversational remote user re- 
quires access to many of the facilities 
available to the machine operator in 
the form of console buttons, lights, 
and switches. He should receive steady 
reassurance that “all is well’’ by some 
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form of periodic ‘‘blinking^' at his 
terminal. He also needs the ability to 
stop his “machine'' at any time and 
without loss of data — so that he can 
perform such simple functions as 
changing some printer paper, placing 
more input cards in a reader, or dis- 
continuing a job. 

3. The remote user is very conscious of 
input/output volumes. He must have the 
capability to modify decks without com- 
plete retransmission, and he should have 
the option to selectively inspect and list 
output data, as opposed to massively 
transmitting entire output files. Also in 
this spirit, he desires to keep His various 
decks in random storage — quickly and 
conveniently available for modrfication, 
processing, or review. 

4. Finally, the remote user should be given 
the impression that he is the only user 
and that he is in complete control of the 
situation. More specifically, in a time- 
sharing environment, he should be totally 
secure from unwanted, possibly destruc- 
tive, interaction by others. Ideally, the 
computing and response rate of his ter- 
minal should not radically fluctuate ac- 
cording to the demands of the rest of the 
user population ; in other words, his 
“share" of the central system should per- 
form at a relatively uniform processing 
rate. 

Application Criteria 

The following criteria were among those used 
in deciding whether commercial or scientific 
applications were more amenable to remote 
operation : 

1. Time devoted to program development vs. 
production runs ; 

2. Importance of job turnaround vs. com- 
puter throughput ; 

3. Available programming languages ; 

4. Conversion problems ; 

5. Reliability objectives ; 

6. Input/output volumes ; 

7. Random-storage requirements. 


It was concluded that there was more imme- 
diate technological significance and lower hard- 
ware-software risk in placing initial emphasis 
primarily on the scientific applications area. 

Design Objectives 

The following functional design objectives 
were then established : 

1. Output data should be as user-oriented 
as the source language ; 

2. Diagnostic messages and logical analysis 
should be definitive enough to allow pro- 
gram debugging to take place at the same 
level as program construction ; 

3. The user should have immediate and 
sustained access to the computer ; 

4. The user should have the ability to exe- 
cute, alter, and change values, variables, 
and formulas, and to request information 
selectively ; 

5. The system should be at least as easy to 
learn as the FORTRAN language ; 

6. The print volume should be minimized 
without loss of quality, on demand of the 
user; 

7. The system should provide the shortest 
possible solution time, ideally no longer 
than the time required to construct and 
run the solution itself. 

Design Constraints 

Finally, the following restrictions were im- 
posed : 

1. Use only an existing standard equipment 
configuration ; 

2. Use, and stay consistent with, an existing 
language. 

The first constraint serves to keep attention 
primarily on fundamental programming prob- 
lems and discourages the favorite desire of 
many engineers to solve systems problems by 
the design of a new feature or the development 
of new devices. 

The second constraint serves to keep atten- 
tion primarily on the processor design and dis- 
courages the favorite desire of most program- 
mers to solve systems problems by the design 
of new languages or the development of new 
compilers. 
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The Approach 

Our approach to accomplishing the objectives 
fuses the old technique of interpretive execu- 
tion with the relatively new one of time-sharing 
a CPU. Thus the cost of sustained access to a 
computer by an individual is spread over a wide 
base. The internal form suitable for interpre- 
tive execution retains all the information con- 
tained in the user's original statemenj: of the 
problem, thereby making symbolic debugging 
possible. Together, these two techniques make 
the conversational mode of operation on cur- 
rent equipment a practical reality. 

Nevertheless, the service this system per- 
forms is not a matter of cleverly getting some- 
thing for nothing, but a justifiable trade-off. 
Execution time is greater, but elapsed solution- 
time is significantly smaller. The cost of the 
total equipment configuration is comparable to 
that of typical large computer systems, but the 
cost per terminal is in the small computer range. 
In short, this system converts some of the raw 
power of the computer into condensed solution- 
time and greater creative power for the user. 

OPERATIONAL DESCRIPTION 
Eqidpm ent Configuration 

The hardware (see Figure 1.1) consists of: 

1. An IBM 7040^^’ with 32K memory ; 

2. An IBM 13012'^ disk-file storage, for 
permanent retention of user programs; 

3. An IBM 7320-^ drum storage, for the con- 
tinual swapping of user programs ; 


TERMINALS EXCHANGE CPU STORAGE 

DEVICES 



Figure 1.1. Remote-Computing System: Equipment 
Configuration 


4. A few magnetic tape units, for logging 
system transactions and to maintain nor- 
mal computer capability ; 

5. An IBM 7740 communications control 
system, 29 for the real-time acceptance and 
transmission of messages ; 

6. A number of IBM 1050^® terminals with 
keyboard-printer and, optionally, a card 
reader and card punch. 

The User's Terminal Console 

In use, the terminal console (see Figure 1.2) 
appears to be a self-sufficient FORTRAN ma- 
chine. The user is completely unaware of any 
assembly system or the internal organization 
of the central computer. The language is con- 
sistent with FORTRAN, augmented by a set 
of operating, testing, and debugging state- 
ments. The mode of communication is called 
“conversational," as opposed to “batch," be- 
cause the basic unit of input is the individual 
statement rather than an entire program, and 
every communication by one of the participants 
is acknowledged by the other. 

The form in the terminal printer (see Figure 
1.3) consists of 126 columns (10 characters/ 
inch). The first 12 (unnum.bered) columms are 
reserved for control fields, and the remaining 
114 (numbered) columns are identical to a 
FORTRAN coding form, except for length. 

The first five columns of the control-field por- 
tion are used to display a line number (101.0 



Figure 1.2. IBM 1050 Data Communications System 
(with card reader). 
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LINE 

STATUS 

|•tATCMCNT 

NUMKII 

Ij2 5 « 

STATEMENT 

7 10 15 20 25 30 



1 




1 




1 




j 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I i t 1 J 1 



1 

_ L t - L I .L i 1 1 1 1 1 1 1 1 L 1 1 1 1 1 1 1 1 1 1 1 1 



1 

_ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 L 1 > 1 1 I 1 1 1 t » 1 » 



1 

1 1 1 1 

_ 1 1 L 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 i t 1 1 1 1 1 1 1 



1 ' ‘ - 
! . 1 . 

1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

till, 

• 1 1 1 1 1 

! . . . 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 ■ 1 1 1 1 1 1 1 I 1 I 1 I 





_L- — — \ 


Figure 1.3. Remote-Computing-Terminal 
Programming Form. 


to 999.9), i.e., an identifier automatically gen- 
erated by the system to uniquely label each 
program statement. The first of the remaining 
seven control columns contains a code character 
— minus, plus, or equal — denoting the source 
of the line — input in Command or Program 
mode (see below), or output. The second 
through sixth columns of this field contain a 
status word which cues or informs the user: 
for example, the cue word READY invites the 
user to enter his next statement; and ERROR 
readily identifies a diagnostic message. (The 
seventh column is always blank.) 

In the first two (numbered) columns of the 
FORTRAN-like portion of the form, “C blank” 
is treated as an ordinary comment, while '‘CV” 
causes the statement to be ignored by the sys- 
tem and serves as a hard-copy comment not 
germane to the program itself. Any character 
other than '‘C” in column 1 is considered a 
monitor-control character and the statement is 
treated as a normal comment; ''CF” is treated 
as a comment in FORTRAN, but considered 
a normal statement in this system, and thereby 
serves to keep a source-program card deck 
compatible with other FORTRAN compilers. 

General Operating Statements 

The general operating statements (see Fig- 
ure 1.4) may be used at any time. COMMAND 
establishes the Command mode (see below). 
EXIT signs the terminal off. Each terminal is 
set to standard real and integer formats (E15.8, 
I 11) for output of all values not under explicit 
FORMAT control. EDIT ( — ) permits the 
user to change either or both. 


GENERAL 

OPERATING 


COMMAND 

EXIT 

EDIT 


PROGRAM 

DEFINITIONS 


LOAD 

PROGRAM 

SUBROUTINE 

FUNCTION 


PROGRAM 

REFERENCE 


NUMBER 

LIST 

DUMP 

INDEX 

CHECK 

AUDIT 

TRACE 

TRAP 


PROGRAM STATEMENTS 

SUBROUTINE 

IF 

FUNCTION 

DO 

EXTERNAL 

CONTINUE 

REAL 

PAUSE 

INTEGER 

STOP 

DIMENSION 

RETURN 

COMMON 

CALL 

EQUIVALENCE 

ARITHMETIC 

FORMAT 

READ 

END 

PRINT 

GOTO 

PUNCH 

GOTO ( ).I 

WRITE 


PROGRAM 

OPERATING 


CC DOMAIN 

ALTER 


Arithmetic 

ASSIGN 


READ 

RESET 


PRINT 

UNLOAD 


PUNCH 

START 


WRITE 


Figure 1.4. System Language. 


Command Mode 

When no program is active for a given termi- 
nal, that terminal is said to be in the Command 
mode ; and, conversely, the entering of a COM- 
MAND statement will destroy the active image 
of the user program. Since no program is ac- 
tive, statements cannot be retained, but must 
be processed immediately. Consequently, the 
user may employ only the general operating 
statements, the program-definition statements 
(see below), or a limited form of the arithmetic- 
assignment statement. This latter provision 
allows the terminal to be used as a fast, versa- 
tile symbolic desk calculator. In this mode, 
the user enters a statement of the form, X=:any 
expression consisting of constants and built-in 
functions, and the system immediately evalu- 
ates the expression and prints the result on the 
user's terminal. 

Program Mode 

1. Program-Definition Statements 

The program-definition statements (see 
Figure 1.4) initiate the Program mode. 
LOAD fetches an existing program from 
the user's library, while PROGRAM initi- 
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ates the creation of a new main program, 
and SUBROUTINE or FUNCTION intro- 
duce subprograms. 

2. Program Statements 

In the Program mode, certain statements, 
called program statements (see Figure 
1.4), are translated and retained. Conse- 
quently, unlike the arithmetic assignment 
statement in the Command mode, their 
execution is initiated by the user. All FOR- 
TRAN statements used by this system 
(see Figure 1.4) are program statements. 
It is with these that the user may construct 
a stored-program solution to his problem; 
all other statements are processed imme- 
diately and are not retained. As with FOR- 
TRAN, there can be only one main pro- 
gram — but there can be numerous sub- 
programs, with the restriction that no 
single subprogram exceed 4000 words of 
storage. In this way, although individual 
program size is restricted, total program 
size may be much larger. 

3. Program-Operating Statements 

The program-operating statements (see 
Figure 1.4) allow the user to execute, alter, 
select I/O components, reset certain initial 
conditions, and unload his programs to 
library storage, at any time. 

START with various operands allows the 
user to begin execution from the first or 
any other executable statement, or to exe- 
cute a segment from one line or statement 
number to another, or to resume execution 
after manual intervention, 

ALTER allows the deletion and insertion 
of statements. 

SELECT permits specifying the console’s 
I/O devices other than the keyboard- 
printer. 

RESET with various operands initializes 
the program for fresh testing runs. 
UNLOAD places the user program in li- 
brary storage, but does not remove it from 
the active status. 

SOURCE LANGUAGE DEBUGGING 

Debugging information is requested and dis- 
played in a form consistent with the source 
programming language.^^* 


Diagnostic Structure 

Errors committed by the user may be classi- 
fied in two broad categories: syntactic and 
semantic.-^^ 

1. Syntactic Err or s^^ 

All syntatic errors are considered the re- 
sponsibility of the system and are further 
categorized as follows : 

Composition, Typographical errors, viola- 
tions of specified forms, and misuse of 
variable names (e.g., incorrect punctua- 
tion, mixed-mode expressions, undeclared 
arrays, etc.) 

Consistency. Statements which are in 
themselves correct, but conflict with other 
statements (e.g., conflicting declaratives, 
illegal statement ending a DO range, fail- 
ure to follow each transfer statement with 
a numbered statement, etc.) 

Completeness. Programs which have not 
been completely specified by the user (e.g., 
transfers to nonexistent statement num- 
bers, improper DO nesting, illegal transfer 
into the range of a DO loop, etc.) . 

Errors of composition and consistency are 
detected immediately upon entry of the 
offending statement. The user may imme- 
diately substitute a correct statement. 

Errors of completeness are discovered 
when the user signifies that his program 
is complete (by entering the END state- 
ment) . 

Some errors (e.g., invalid subscript value, 
reference to an undefined variable, arith- 
metic spills, etc.) may, of course be de- 
tected only during execution. In this case, 
after a display of the error condition and 
its location, execution is interrupted and 
the terminal reverts to READY status. 
The user then has the option of either im- 
mediately correcting his error or proceed- 
ing with the rest of his program. 

For all syntactic errors, the diagnostic 
message is concise — in that the variable in 
error is named, or the column where the 
error occurred is specified — and often 
tutorial in suggesting the procedure for 
obtaining correct results. 
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2. Semantic Errors 

Semantic errors are concerned with the 
meaning or intent of the programmer and 
are necessarily his responsibility. However, 
an extensive set of debugging aids are pro- 
vided for manipulating and referencing a 
program in ferreting out errors in logic 
and analysis. 

Value Manipulation 

Not too surprisingly, some types of program 
statements are also useful for manipulating the 
values of a user program (see Figure 1.4). 
Consequently, special characters — called proc- 
ess codes — may be inserted into the first two 
columns to allow the use of these statements 
as commands, thus causing values to be read 
into or out of selected variables. (This is 
analogous to the panel-entry/display functions 
performed at a conventional computer console.) 

For example, “CC’’ in columns 1 and 2 of the 
FORTRAN-like form (Figure 1.3) has the fol- 
lowing effect on its accompanying statement: 
the statement is immediately executed with all 
the effects of normal execution, but no new 
variable names are created; the statement is 
then discarded and does not become a part of 
the program. Thus, values may be inserted 
into factors or parameters at any time, thereby 
creating completely new testing situations with- 
out having to build their presence into the logic 
of the program or attempting to anticipate the 
debugging operations required. 

Program Reference Statements 

Program reference statements (see Figure 
1.4) allow the user to display various vital con- 
ditions of his program. These statements are 
not retained. They are acted upon immedi- 
ately, then discarded. Through their use, a 
complete, dynamic record of both control flow 
and data usage may be obtained. 

DUMP (Figure 1.6b, line 143) produces an 
alphabetically ordered listing of program iden- 
tifiers with their current values. Array sub- 
scripts are stepped automatically, contiguous 
zero- valued array elements are omitted, and 
empty elements (i.e., those never receiving a 
value) are flagged. 


TRACE with various operands (Figure 1.6c, 
line 144 et seq.) allows the user during later 
execution to print out every change of value 
for all variables, for all variables within a 
specified region, or for specific variables when- 
ever they are changed. 

TRAP (Figure 1.6c, line 145 et seq.) is a 
logical trace of all control transfers, or of all 
transfers within a specified region. 

NUMBER (Figure 1.6e) with various oper- 
ands resequences and lists the program. 

LIST generates a listing of the entire pro- 
gram or any specified portion of it. 

INDEX (Figure 1.6f, line 231) produces a 
complete cross-reference table, ordered on 
statement numbers and variable names, show- 
ing the line number of every statement in which 
each statement number or variable name ap- 
pears and whether it was declared, defined, or 
referred there; or, INDEX produces a single 
such line for a specified variable or statement 
number. Any line of the table which is, or 
may be, in error is marked with an asterisk. 
These features are very useful when making 
program modifications. 


lOl -READY 

101 -READY 

102 +REA0Y 

103 +READY 

104 +READY 

105 +READY 

106 +READY 

106 +ERR0R 

107 +READY 

108 +READY 

109 4>REA0Y 
no +READY 

111 +READY 

112 ■I'READY 

113 +READY 

114 +READY 

115 +READY 

116 +READY 

117 +READY 

118 +READY 

119 +READY 

no =1101 

112 =0102 

114 =CRR0R 

120 +READY 

115 =0103 

116 = CYCLE 

121 +READY 

122 +READY 

123 +READY 

124 +READY 

125 mADY 

126 +READY 

127 +READY 

128 +READY 

129 +READy 

130 +READY 
130 4>ERR0R 
130 +ERR0R 
13] +READY 

1 32 +READY 

133 +READY 

134 +READY 

135 +READy 

136 +READY 

137 +READY 

138 •1-REaDY 

139 +READY 


2 = 
04200. 
CV 
CV 


101 

102 


103 

CV 

CV 


THIS IS A sample PR09RAI1 
PR0QRAM SAMPLE 

DIMENSION ZPL0T <5I), TABLE<500) 

DELX = .2 
X = 0 
Y = 1. 

1 

. STATEMENT H0T IN LAHQUAfiE. 

typographical errors may be CORRECTED IMMEDIATELY 
BY SUBSTITUTING A CORRECT STATEMENT VIA KEYBOAtoJ 

READ 101, char, ZPLOT 
F0RMAT<52A1) 

PRINT 102 

P0RMAT(5X1HX5X1HY) 

GOTO 2 

PRINT 103, X , y 
F0RMAT<2XF5.2,r8.5) 

SEQUENCE OF STATEMENTS HAY BE 
StSTo*^° IMMEDIATE EXECUTION AFTER ENTRY, 


X Y 

transfer POINT N DOES NOT EXIST 
START 1 
0. 1.00000 

END OF program ENCOUNTERED DURING EXECUTION 
1=1+2 
X = X + DELX 
DELY = X*Y*DELX 
T = Y + DELY 

2 TABLE(I) = X 
TABLECI+1) = Y 
IFCX - l.)|,i,3 

3 DO 4 J = 1, I, 2 
X = Table (J) 

)-TA3LE(2) )/< TABLE! I+l ) - lABLECa) )*50. ) 
ARITHMETIC DECOMPOSITION ERROR! S) 

MIXED MODE 

ARE NOT ACCEPTED. 

S^STITUTION hay be made WITHOUT RE-ENTERING PROGRAM. 

it 5 J+n “ T ABLE! 2 ) >/ ! T ABLE! 1+ 1) - T ABLE! 2 ) > *5 0. ) 

ZPLOT! K) = char 
PRINT 104, X , ZPLOT 
. F0RMAT(F5.2,51 Al) 

4 ZPLOT! K) = ZPLOT! K+1) 

STOP 77 
END 


CV 


104 


Figure 1.6a. Sample Program: Creation and Testing. 
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140 

+READY 

start 0 

110 

= :ioi 

* 

112 

=0102 

X Y 

115 

=0103 

0. 1. 00000 

115 

=0103 

0.20 1.04000 

115 

=0103 

0.40 1.12320 

115 

= 0103 

0.60 1.25798 

115 

= 0103 

0.80 1.45926 

115 

=0103 

1.00 1.75111 

135 

=0104 

0. ♦ 

135 

=0104 

0.20 * 

135 

= 0104 

0.40 ♦ 

135 

=0104 

0.60 « 

135 

=0104 

0.80 ♦ 

135 

=0104 

1.00 

155 

=0104 

1.20 

137 

=ERR0R 

value 0F SUBSCRIPT IS ZER0, 

141 

+REA0Y 

DUMP 


z 

CHAR=-0. 14191581 E-08 



DELX= 0.20000000E-00 



DELY= 0. 42026726 E- 00 

142 

+READY 

EDIT(F8,5) 

143 

+READY 

DUMP 


ERR0R 

illegal character in text 
DUHP 



CHAR= -0.00000 



DELX= 0.20000 


= 

DELY= 0.42027 


NEGATIVE, 0R EXCEEDS D1HENSI0N 


Is 

J= 

Ks 

X= 1.20000 
Y= 2.17138 
TABLE(1)= 0. 
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+READy 

TRACE R 

145 

+READY 

TRAP 101. /138. 

146 

+READY 

start 0 

no 

= 1101 

« 

112 

=0102 

X Y 

114 

=trap 

TRANSFER T0 2 «125 ) 

127 

=trap 

transfer T0 1 Cl 15 ) 

115 

=0103 

0. 1. 00000 

127 

STRAP 

TRANSFER T0 1 Cl 15 > 

115 

=0103 

0.20 1.04000 

127 

=TRAP 

TRANSFER T0 I Cl 15 > 

115 

=0103 

0.40 1.12320 

127 

strap 

transfer T0 1 C115 ) 

115 

=0103 

0.60 1.25798 

127 

strap 

TRANSFER T0 1 Ctl5 > 

115 

=0103 

0.80 1.45926 

127 

=TRAP 

TRANSFER T0 1 Cl 15 ) 

115 

=0103 

1.00 1.75111 

127 

=TRAP 

TRANSFER T0 3 C128 ) 

133 

=TRACE 

K= 1 

135 

=0104 

0. ♦ 

133 

=TRACE 

K= 2 

135 

=0104 

0.20 ♦ 

133 

sTRACE 

K= 6 

135 

=0104 

0.40 * 

133 

= trace 

K= 12 

135 

=0104 

0.60 « 

153 

= trace 

K= 20 

135 

=0104 

0.80 * 

133 

=trace 

K= 33 

135 

=0104 

1.00 

133 

= TRACE 

K= 51 

135 

=0104 

1.20 

137 

=ERR0R 

VALUE 0F SUBSCRIPT IS ZER0, 

Figure 1.6c. Sample Program 


(continued). 


TABLEC21= 1.00000 

147 +READY 


ALTER 137./137. 

TABLEC3>= 0.20000 

137 +ALTER 


ZPL0TCK> = BLANK 

TABLE<4)= 1.04000 

1371+ALTER 


ALTER no. 

TABLEC5)= 0.40000 

llOl+ALTER 


blank = ZPL0TC1) 

TABLEC6>= 1.12320 

1102+ALTER 


ALTER * 

TABLEC7)= 0.60000 

11O2+ERR0R 

00 

128.0 REFERENCES UNDEFINED LABEL 4 

TABLEC8)= 1.25798 

148 +READY 


alter 137./137. 

TABLEC9)= 0.80000 

137 +ALTER 

4 

ZPL0TCK) = BLANK 

TABLECIO): 1.45926 

1371+ alter 


ALTER* 

TABLECll): 1.00000 

149 +READY 


TRACE* K 

TA8LE(I2>= 1.75111 

150 +REA0Y 


trap* 101. /138. 

TABLE(13>= 1.20000 

151 +READY 


START 0 

TABLEC14): 2.17138 

110 =1101 

* 


TABLE<500)= 0. 

112 =0102 


X Y 

ZPL0TC1)=-6.O9524 

115 =0103 

0. 

X 1.00000 

ZPL0TC2)s -6.09524 

115 =0103 

0. 

20 1.04000 

ZPL0T<3)=-6. 09524 

115 =0103 

0. 

40 1.12320 

ZPLSTC4)==S. 09524 

115 =8103 

0. 

60 i. 25798 

ZPL0TC 5) =-6.09524 

115 =0103 

0. 

80 1.45926 

nay be INTERRUPTED. 

115 =0103 

1. 

00 1.75111 


135 =0104 

0. 

* 


135 =0104 

0.20 

* 


135 =0104 

0.40 

* 


135 =0104 

0.60 

* 


135 =0104 

^.80 

* 


135 =0104 

1.00 

* 


135 =0104 

1.20 



138 =S77 




152 +READY 

» 



Figure 1.6d. 

, Sample Program; Creation 




(continued). 


Figure 1.6b. Sample Program: Creation and Testing 
(continued) . 

CHECK (Figure 1.6f, line 232) is an abbre- 
viated INDEX in that only erroneous and 
suspicious items are displayed (i.e., only those 
INDEX lines marked with an asterisk). 

AUDIT generates cross-reference informa- 
tion based on the execution of the program, 
showing which sections were never executed 
and which variables were never set, or set but 
never used. This concise, post-mortem sum- 
mary of incomplete control flow and data usage 
is a powerful aid in ensuring the thoroughness 
of program debugging. 


201 = 
202 : 

203 = 

204 : 

205 = 

206 = 
207 = 
203 : 

209 r 

210 = 
211 = 
212 = 

213 = 

214 = 

215 = 

216 = 

217 = 

218 = 

219 = 

220 = 
22! : 
222 = 

223 = 

224 = 

225 = 

226 = 

227 = 

228 = 

229 : 

230 : 


CF 


NUMBER 201. 

PR0GRAM SAMPLE 

dimension ZPL0T(51),TABLE(5OO) 

DELX=.2 

X=0 


READ 101, CHAR, ZPL0T 
BLANK=ZPL0T(1) 

101 F0RHAT<52A1) 

PRINT 102 

102 F0RMATC5X1HX5X1HY) 

G0 T0 2 

1 PRINT 103, X,Y 

103 F0RMAT<2XF5.2,F8.5) 

1 = 1+2 

X=X+DELX 

DELY:X*Y*DELX 

Y=Y+D£LY 

2 TABLECDrX 
Ta8LE< I+1)=Y 
IF(X-|.)1,1,3 

3 D0 4 J= 1,1,2 
X=TABLECJ) 

K=1.+(TABLECJ+1>-TABLEC2))/<TAE 

ZPL0TCK)=CHAR 

PRINT 1O4,X,ZPL0T 

104 F0RHATCF5.2,5IA1) 

4 ZPL0TCK): BLANK 
ST0P 77 

END 


I+1)-TaBL£C2))*50, 


Figure 1.6e. Sample Program: Creation and Testing 
(continued) . 
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3. As in most compilers, the sequence of 
translated code for arithmetic expressions 
may differ from that produced by other 
compilers and slight discrepancies due to 
variations in truncations jnay occur. 

4. Some minor differences in the internal 
representation of program constants, 
caused by different conversion routines, 
may also create slight differences in nu- 
merical results. 


231 +READY INDEX 


r 

1 

+213. 

-221. 




z 

2 

+219. 

-212. 




z 

3 

+222. 

-221. 




• 

4 

+228. 

-222. 




z 

* 5 

+207. 





z 

101 

+209. 

-207. 




z 

102 

+211. 

-210. 




z 

103 

+214. 

-213. 




r 

104 

+227. 

-226. 




; 

BLANK 

+208. 

-228. 




z 

CHAR 

+207. 

-225. 




z 

OELX 

+203. 

-216. 

-217. 



z 

DELY 

+217. 

-218. 




z 

I 

+206. 

+215, 

-215. 

-219. 

-220. 

z 


-222. 

-224. 




z 

J 

+222. 

-223. 

-224. 



z 

X 

+224. 

-225. 

-228. 



z 

♦SAMPLE 

201. 





z 

table 

202. 

+219, 

+220. 

-223. 

-224. 

z 

X 

+204. 

-213. 

+216. 

-216. 

-217. 

z 


-219. 

-221. 

+223. 

-226. 


z 

Y 

+205. 

-213. 

-217. 

+218. 

-218. 

z 


-220. 





z 

ZPL0T 

202. 

+207. 

-208. 

+225. 

-226. 

z 


+220. 





252 +READY 

CHECK 






= 

* 5 

+207. 





- 

♦SAMPLE 

201. 





235 +READY 








Figure 1.6f. Sample Program: Creation and Testing 
(continued). 

Built-in Subroutines 

Since only program statements are retained, 
many excellent testing and debugging com- 
mands would not be available under program 
control, but would require the presence of the 
user at the moment of execution. To overcome 
this limitation, most of these statements have 
been designated as “built-in subroutines,’' a 
concept completely analogous to FORTRAN 
built-in functions. These statements, without 
change in their form, may be made the operand 
of a subroutine CALL statement. In this way, 
all the console testing and debugging features 
which may be of value are also available under 
program control. 

COMPATIBILITY 

Studious regard has been paid to maintaining 
consistency with other FORTRAN compilers. 
Programs written in the system language are 
acceptable without change to conventional 
FORTRAN IV processors. FORTRAN IV pro- 
grams are acceptable to the experimental sys- 
tem with the following limitations : 

1. The program must be written with state- 
ments from the system subset. 

2. A restriction of all one-pass translators 
is that the source-deck ordering must have 
the declarative statements precede the 
imperative statements. Of course COM- 
MENT and FORMAT statements may ap- 
pear anywhere. 


5. Individual source programs are limited 
to approximately 400 statements. This 
limit may often be circumvented by seg- 
menting oversized programs into smaller 
subprograms. 

6. Other Factors : 

a. No arithmetic function statements ; 

b. No logical, complex, or double-preci- 
sion variables ; 

c. Number of elements (i.e., constants, 
variables, arrays, and functions) must 
be less than 190 ; 

d. Only one continuation card ; 

e. No magnetic tape I/O ; 

f. Some minor restrictions on equated 
variables ; 

g. Constants — 

Reals : 8 digits, with magnitude within 
range 10—^2 iq32 qj. with zero mag- 
nitude, 

Integers : 10 digits ; 

h. Array names must appear in a DI- 
MENSION statement prior to any 
other appearance ; 

i. Maximum I/O record size is 133 char- 
acters ; 

j. Array names used as arguments must 
be declared in COMMON. 

EXAMPLES 

A program exhibiting many of the features 
available in this system is depicted in Figures 
1.5 and 1.6. Figure 1.5a shows the final, cor- 
rect version of the program. Figure 1.5b shows 
the correct output produced as a result of execu- 
tion (see START statement, Figure 1.5a, line 
128). 
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CeHNANO 

101 -READY C THIS IS A SAMPLE PRBGRAM . 

101 -READY C 

101 -ready PR0GRAM SAMPLE 

102 «EADY DIMENSI0N ZPL0T(52), TABLE (500) 

1013 -IREADY X = 0 

104 -tREADY Y = 1, 

loi «EADY READ *101, DELX ,CHAR ,ZPL0T 

107 -IREADY 101 FORMAT (F 7. 4,53A 1 ) 

108 -WEADY PRINT 102 

109 -WEADY 102 FORMAT (X1HX7X1HY) 

110 -JREADY 2 TAK.E(I ) = X 

111 4READY TABLE<I+1)=Y 

112 -WEADY 1 PRINT 103, X, Y 

113 -tREADY 103 FORMAT (atF7.4,F8.5) 

114 -WEADY IF(X-1,)5,3,3 

115 -tREADY 5 1 = I -t- 2 

116 -tREADY X = X -t- DELX 

117 -tREADY DELY = X * Y ♦ DELX 

118 -tREADY Y = Y -t- DELY 

119 -tREADY 60T0 2 

120 -tREADY 3 DO 4 J : !• I , 2 

121 -tREADY X = Table cj) 

122 -tREADY K sl.-KaABLE CJ-t-D-TABLE (2))/aABLE a -t-l)-TABLE (2) )*50. ) 

123 -tREADY ZPLOT (K > = CHAR 

124 -tREADY PRINT 101, X, ZPLOT 

125 -tREADY 4 ZPLOT CK ) =ZPL0T (K-t-l ) 

126 -IREADY STOP 77 

127 -IREADY END 

128 -IREADY START 0 


Figure 1.5a. Sample Program: Final Form. 


106 

=1101 

00.0625* 


108 

^102 

X 

Y 

112 

=0103 

0. 

1.00000 

112 

=0103 

0.0625 

1.00391 

112 

=0103 

0.1250 

1.01175 

112 

=0 103 

0. 1875 

1.02361 

112 

=0103 

0.2500 

1.03960 

112 

=0103 

0.3125 

1.05990 

112 

=0103 

0.3750 

1.08475 

112 

=0103 

0.4375 

1.11441 

112 

^103 

0.5000 

1.14923 

112 

=0103 

0.5625 

1.18963 

112 

:«103 

0.6250 

1.23610 

112 

=0103 

0.6875 

1.28922 

112 

=0103 

0.7500 

1.34965 

112 

=0103 

0.8125 

1.41819 

112 

=0103 

0.8750 

1.49574 

112 

=0103 

0.9375 

1.58339 

112 

=0103 

1.0000 

1.68235 

124 

=0101 

0. ♦ 


124 

=0101 

0.0625* 


124 

=0101 

0.1250* 


124 

=0101 

0.1875 * 


124 

=0101 

0.2500 * 


124 

^101 

0.3125 

* 

124 

^101 

0.3750 

* 

124 

=0101 

0.4375 

* 

124 

=0 101 

0.5000 

* 

124 

=0101 

0.5625 


124 

4101 

0.6250 


124 

=0101 

0,6875 


124 

=0101 

0.7500 


124 

=0101 

0.8125 


124 

4101 

0.8750 


124 

4101 

0,9375 


124 

4101 

1.0000 


126 

=S77 



129 -IREADY 

UNL0AD 


Figure 1.5b. Sample Program: Final Execution. 


Figure 1.6 depicts a preliminary attempt to 
create and test this program. (All references 
that follow are to Figure 1.6.) 

Input to the system may be from the key- 
board or card reader at the remote terminals, 
or through input equipment located at the cen- 
tral computer. At line 106 a mispunched card 
causes printing of an error message. The user 
now suspends automatic input, substitutes a 
correct statement via keyboard, and then re- 
sumes automatic input. Of course, the substitu- 
tion could have been made later by means of an 
ALTER (see below). 


At lines 119 and 120, the user initiates inter- 
mediate execution and verifies his FORMAT 
statements before going further. In this man- 
ner, any statement, sequence of statements, DO 
loop, etc., may be debugged as the program is 
entered; or sections may be tested independ- 
ently of the remainder of the program. 

Execution of the entire program, line 140, 
discloses a number of bugs. Inspection of line 
137 discloses the use of K as subscript. K could 
be printed selectively, but the user decides to 
dump all variables (see line 141). After DUMP 
starts, he interrupts it in order to change the 
format of the display and then dumps again 
(see line 143). In the event that the dump 
showing K =; 51 is not a sufficient clue to the 
error, the user establishes a TRACE on K and a 
TRAP on the entire program, and starts again 
(see lines 144-146). This produces, together 
with his programmed output lines, a dynamic 
listing of control and data flow, before termi- 
nating with the same error message. 

At line 147, the statements in error are cor- 
rected, but a statement number is inadvertently 
omitted. On terminating the ALTER status, 

p nnckoqpnrp i o tm-i n+cirl 'rkriinfi'n nr rkiif +Vi€i+‘ "Hio 

at line 128 references a nonexistent label. This 
error is corrected and a subsequent running of 
the program, line 151, shows that the subscript 
is now behaving properly. 

There are other changes to be made, however. 
The NUMBER at line 152 yields a clean, re- 
numbered listing of the current state of the 
program. Line 231 shows a complete INDEX, 
and line 232, the results of a CHECK statement. 
All of these will be helpful in reorganizing and 
documenting the final, correct version of the 
program. 

Figure 1.7 shows the immediate evaluation 
of arithmetic expressions, consisting solely of 
constants and FORTRAN functions, in the 
Command mode. 

EXPERIENCE 

The system, from its most primitive form to 
the present, has been running for more than a 
year. A formal tryout of the system was run 
in November 1963 with 10 students attending 
IBM's Systems Research Institute. 
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CeNMARD 

lOI -READY Y= 2 . 506 $«IO.**(lO.-fl.> 4 EXPr(*tO.> 

101 = Y= 0,ll37948SE 08 

101 -READY HEMRY = 2.E-9*50,*<L86F(2.*50./10.J-UO+10./50. 

fOl -ERROR 04117. PARENTHESES NOT IN BALANCE. 

101 -READY HENRY =2.E -9*50.*a.0QF (2.*50./10. )-l.+l0./50. ) 

101 = HENRY: 0. 1902389<C -06 

101 -READY R80T1 = (-29.4SORTF(23.**2-4.*1.«2.>>/<2.«I> 

101 -ERROR ARITHMETIC DECOMPOSITION ERRORS) 

101 -ERROR MIXED RODE 

101 -ready RO0T1:(-25--«ORTF(25.**2-4.*1.*2.))/(2.*1.) 

101 = ROOT lr-0. 8025 7654E -01 

101 -READY HENRY=2J:-9^50.*<LOeF<2.*50./10.)-1.0+10./50. ) 

101 5 HENRY: 0. l502585tE-06 

101 -READY VAL:1./C0SF(5O.)**LOGF(ABSF SIHF(50./2.)>C0SF(50./2,)>) 

101 : VAL=-0.977I499«E 00 

101 -READY AREA:2.*10.*5.46INF (3.1416/10.) 

101 : AREA= 0.30901 7688 02 

101 -READY ARC:2.'t«QRTF<4.**2+1.3333*2.**2) 

101 s ARC: 0.92375753E 0! 

101 -READY ARC=2.*(4.*4.+4.*2.*2./3.)**0.5 

tOI : ARC: 0.9237604IE 01 

lOI -READY S:-C0SF(40.)4*(20.+l.)/20./l.) 

101 -ERROR 04117. PARENTHESES NOT IN BALANCE. 

101 -READY S:0.9^06F((1.-»SINF<45.))/(I.-SINF<49.))) 

101 : 0= 0. 12594177E 01 

101 -READY SSINF{49.) 

101 = S: 0.35090352E 00 

101 -READY 6:0.5<L06F(<1.+,7071)/(1.-.7071)) 

101 : 6= 0.88I3599SE 00 

101 -READY E:20.*ATANF(20./4.)-4./2.tLOGFC4.**2+20.**2) 

101 : E: 0.15406644E 02 

101 -READY 0:<2./(3. 1416*10. ))**0.5«INF< 10. ) 

101 = 0:-0.13726397E-00 

101 -ready 0:0.7978/SQRTF(IO.)«INF<10.) 

101 : 0:-0. 1372491 BE -00 

101 -READY C 
101 -READY 


the 7090 before you obtained correct re- 
sults?'' (Number of runs were summed.) 

4. “How many problems did you debug on 
the 7090?" 

5a. “Approximately how many hours of 
training did you have on the terminal 
console ?" 

5b. “How many debugging hours?" 

6. “How many problems did you debug on 
the terminal console?" 

Because this experiment was of limited scope, 
the experience reported must be taken cau- 
tiously. There are many variables which affect 
the usefulness and economy of this approach, 
and continuing field trials will yield more pre- 
cise information. 


Figure 1.7. Examples of Command Mode Operation. 


USER 

BACKGROUND 

SKILLS 

SRI EXPERIMENT | 

7090 FORTRAN 

REMOTE COMPUTING 

FORTRAN 

TYPING 

NO. RUNS 

NO. PROGRAMS 
DEBUGGED 

NO. HI 

TRNG 

OURS 

DBUG 

NO PROGRAMS 
DEBUGGED 

A 

HIGH 

LOW 

Norc 

NONE 

2 

2 

3 

8 

HIGH 

LOW 

5 

3 

2/3 

1 1/3 

2 

C 

HIGH 

NO*C 

NONE 

NONE 

2 

3 

3 

0 

MEDIUM 

HIGH 

12 

5 

1 

6 

4 

E 

MEDIUM 

HIGH 

6 

2 

3 

3 

2 

F 

uya 

MEDIUM 

NONE 

NONE 

2 

6 

5 

G 

LOW 

NONE 

10 

3 

2 

4 

2 

H 

LOW 

NONE 

4 

2 

2 

1 

2 

I 

LOW 

NONE 

3 


n/* 

FI/2 

3 

TOTALS 



40 

1 

16 

16 

20 

26 


Figure 1.8. Results of SRI Tryout. 


The students were divided into two groups, 
I and II, and given the same set of problems 
to be solved in FORTRAN. Group I was told 
to do the odd-numbered problems on the IBM 
7090 and the even-numbered ones on the 
remote-computing terminals. Group II re- 
versed this polarity. 

The chart in Figure 1.8 shows the answers 
given by nine of the participants (the tenth 
failed to return his questionnaire) to the fol- 
lowing questions : 

1. “How much FORTRAN experience have 
you had?" (Answer was evaluated HI, 
LO, MED, NONE.) 

2. “Have you had any typing experience?" 
(Answer was evaluated HI, LO, MED, 
NONE.) 

3. “How many times did each problem go to 


SUMMARY 

The time-shared use of a computer provides 
a convenient, economical service to numerous 
remote users. This access is enhanced by use of 
conversational, source-language debugging tech- 
niques. Although the experimental system is 
oriented to the IBM 1050 terminal, the FOR- 
TRAN language, and scientific applications, the 
techniques described are useful with other ter- 
minal devices, programming languages, and 
application areas. Preliminary operating ex- 
perience indicates that systems such as the one 
described have considerable potential in en- 
abling personnel less skilled in the program- 
ming art to rapidly obtain solutions to their 
problems. 
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REMOTE COMPUTING -AN EXPERIMENTAL SYSTEM 

Part 2; Internal Design 

/. M. Keller, E. C. Strum, and G, H. Yang 
Development Laboratory, Data Systems Division 
IBM Corporation 
New York, N. Y. 


INTRODUCTION 

This is the second of two papers dealing with 
the experimental remote-computing system. 
Part 1 described the system as viewed by a user 
who is unaware that he is jointly sharing the 
central computer with numerous other users. 
This paper (Part 2) describes the internal de- 
sign of the system, with attention focused on 
those features which are of general interest 
and applicable to the design of other program- 
ming systems. 

This paper is introduced by a description of 
the over-all control structure and data organi- 
zation. Each of the principal subsystems is 
then described. The paper concludes with some 
remarks regarding possible extended applica- 
tions. An appendix describes in some detail the 
algorithms used in the decomposition/recom- 
position of arithmetic expressions. 

OBJECTIVES 

An operating system servicing numerous on- 
line users must meet certain design objectives 
that might be regarded as secondary or even 
unnecessary in conventional operating systems 
or compilers. But these objectives become para- 
mount when the psychological and practical 
effects of sustained, immediate access to a com- 
puter are considered. Thus primary attention 
must be given to attaining : 


1. Immediate error diagnostics ; 

2. Program alteration without recompiling; 

3. Extensive symbolic debugging aids ; 

4. Ready availability of the source version 
of the user program ; 

5. A user program that is : 

a. dynamically relocatable, 

b. easily interruptible, and 

c. storage protected. 

SYSTEM ORGANIZATION 
Programs 

The experimental remote-computing pro- 
gram is divided into three major system, areas 
(Figure 2.1) : 

1. The Scheduler, which is responsible for 
maintaining awareness of the total sys- 
tem status and for ordering and assigning 
tasks to the other system parts ; 

2. The Process Control system, consisting 
of the Translator, which reduces the 
user’s input statements to an equivalent 
internal form (see below) ; the Inter- 
preter, which executes this internal form ; 
and the Process Control program, which 
regulates these two subsystems on the 
local level ; 

3. The I/O Control system, which is respon- 
sible for monitoring and operating all 
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Figure 2.1. General Block Diagram of System. 

I/O attachments, including the communi- 
cations exchange. 

Data Organization 

At the system level, there are three principal 
data constructs : 


CURRENT STATUS INFORMATION 

DATA OUTPUT FORMATS 

RECORD OF SYSTEM USE 

CURRENT USER I.D. 

CURRENT ACTIVE PROGRAM 

ACTIVE PROGRAM ADDRESS 

CALLED SUBPROGRAM 

j 

Figure 2.2. Terminal Header. 


The Terminal Header 

For each terminal in the system, there is a 
Terminal Header record (Figure 2.2) contain- 
ing the following information : 

1. Current status : 

a. operating mode, i.e., Command or 
Program (Rf. Part 1), 

b. terminal status, i.e., I/O wait, busy, 
or dormant, 

c. control information, i.e., should the 
system interrupt automatic status 
(execution) and return to manual 
status (statement entry) or continue 
automatic status, 

d. type of terminal component active, 

e. terminal ID, 

f . storage allocation block ; 

2. Header information for Command mode 
execution : 

a. formats for data output, 

b. system use records ; 

3. Temporary locations for random storage 
access : 

a. current user identification, 

b. name of current active program, 

c. location of active program for this 
terminal, 

d. name or location of subprogram called 
by current program. 

The Master Block 

For each statement in the language (Rf. Part 
1), there is a Master Block record (Figure 2.3) 
containing the following information : 

1. A statement type identifier ; 

2. A statement class identifier ; 

3. The symbolic, external statement identi- 
fier with associated control characters for 
recognizing the statement name on input 
and for recreating it on output ; 

4. Various indicators which denote intrinsic 
statement characteristics for checking 
purposes ; 

5. Addresses for transfers of control to the 
various major system routines, e.g.. 
Translator, Interpreter, etc. 
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INDICATORS 

CLASS 

CODE 

STATEMENT CODE 

CONTROL 

CHARACTERS 

AND 

SYMBOLIC 

STATEMENT 

IDENTIFIER 

1 

SCAN 

ADDRESS 


LIST 

ADDRESS 

1 

LINK 

ADDRESS 

1 

INTERPRETER 

ADDRESS 

1 

PROCESS CONTROL 
ADDRESS 

1 

PROCESS CONTROL 
ADDRESS 


PROCESS CONTROL 
ADDRESS 




Figure 2.3. Master Block Record. 


The Master Block is used either as a diction- 
ary, when information concerning the state- 
ment is needed, or as a switching center, when 
control flow within the system is dependent 
upon the statement type. A single record for 
both of these activities provides considerable 
flexibility in adding new statements, in modify- 
ing control conditions, and in making basic 
system modifications. 

The User Program Layout 

For the entire system there are two large, 
fixed areas (Figure 2.4) reserved for occupa- 
tion of the various active user programs, Pro- 
grams brought into these areas are relocated 
under program control; all I/O to and from 
these areas is overlapped. The duration of oc- 
cupancy is determined either by overstepping 
a time limit or by the occurrence of one of 
several specific conditions (see following sec- 
tion on user-program organization). 

The layout of the user program is divided 
into two parts : 

1. The statement and element records (see 
following section) which comprise the 
user program ; and 

2. The header. 



Figure 2.4. User- Program Layout. 


The header is further subdivided into two 
parts : 

1. List control and other controls for the 
program records (see following section) ; 
and 

2. Control words used by the Process Con- 
trol program to keep track of program 
status. 

USER-PROGRAM ORGANIZATION 
The User Program 

The user's source-program statements are 
mapped into equivalent internal records, which 
are classified and controlled by list structures. 
These records and their controlling elements 
constitute the user's program (see Figure 2.4). 
Every statement of the user's program is re- 
duced to an individual statement record; and 
every element (name or label) is reduced to an 
individual element record. These records are 
inserted and chained on lists in the program 
area in the order of their appearance and crea- 
tion. Control is maintained through tables of 
list-control words in the header portion of the 
user program. All addresses in the user pro- 
gram are relative to its base in order to facili- 
tate relocation. 
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Records 

Element Records 

An element in the source program is defined 
as a label, constant, variable, array, or func- 
tion name. Every source element maps into a 
fixed-length, internal element record (see Fig- 
ure 2.5) containing the following information: 


REFERENCE 

NUMBER 

TYPE 

MODE 

SIZE 

NEXT ADDRESS 


1 

INDICATORS 

ARRAY/COMMON/EQU 1 VALE NC E 
ADDRESS 

NAME 

VALUE 


Figure 2.5. Element Record. 


1. Reference number — a unique internal nu- 
meric identifier, assigned by the system. It 
is used for all internal referencing by the 
system. 

2. Type — denotes the type of element, i.e., 
label, constant, variable, array, function. 

3. Mode — denotes the mode, real or integer, 
of elements referring to numeric quanti- 
ties. 

4. Indicators — contains properties attributed 
to the element by declarative statements 
and/or execution. These include storage- 
allocation, and indications of element usage 
at object time. 

5. Name — ^the external alphanumeric identi- 
fier. 

6. Value — either the numeric value of the 
element or supplemental information for 
an array or function. 

7. Next address — ^address of the next element 
record. 

8. Array COMMON EQUIVALENCE— 
address of the value, if the element is in 
COMMON or is an array; or an offset ad- 
dress, if the element is equated to an array. 

Statement Records 

Every source statement maps into a variable- 
length, internal statement record (see Figure 
2.6), which contains in coded form all informa- 


ALTER NUMBER 

SIZE 

NEXT ADDRESS 

STATEMENT 

CODE 

INDICATORS 

LABEL 

NEXT CLASS ADDRESS 

R(C) 




* 

R(A) 


R(B) 

PARAMETER 

OPERATOR 

R(D) 



FUNCTION 

OPERATOR 

R(SQRT) 



/ 

R(C) 


temp 

+ 

temp 


temp 

— 

R(C) 


temp 


C = A*B+ C/SQRT(D) 

Figure 2.6. Statement Record. 


tion present in the source statement. Each 
record begins with two standard words con- 
taining the following information : 

1. Alter number — a unique internal numeric 
identifier assigned by the system. It de- 
notes the position of the statement relative 
to all others in the program; it is refer- 
enced by the user when modifying the pro- 
gram, manually requesting information, or 
starting execution, 

2. Statement code — identifier of the partic- 

ular statement type. 

3. Indicators — reflects usage of the state- 
ment during execution. 

4. Label — refers to the associated external 
statement number, if any. 

5. Next address — ^address of the next state- 
ment record. 

6. Next class address — address of the next 
statement record of the same type. 

The remainder of each statement record con- 
tains one or more words. Their number and 
content depend on the particular statement 
type. For example, an arithmetic-statement 
record contains the macro representation of 
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the translated expression, while a DO state- 
ment contains references to the indexing param- 
eters. 

Lists 

The objective of providing for alteration of 
individual statements was the deciding factor 
in determining the internal record organiza- 
tion. The conventional table-oriented approach 
appeared much less attractive than the classi- 
fication of records by lists, 2 . 3 , 4, 5 , e 

Most compilers use tables to record infor- 
mation necessary for referencing and validating 
data usage and control flow. In this system, the 
same information is kept in the statement and 
element records. However, organizing these 
records on lists allows for increased flexibility 
in the compiling system.'^* For example, 

deletion and insertion of statements for pro- 
gram modification is easily provided. Time-con- 
suming recompilations become completely un- 
necessary as a result of this altering provision. 
In addition, errors resulting from improper 
control flow and from invalid variable refer- 
ences can be diagnosed earlier in the compila- 
tion process than is common with conventional 
compilers. 

Element Lists 

Each element record is chained onto one of 
26 element lists, each list consisting of all 
those element records whose symbolic names 
have the same initial letter. Element records 
within each list are ordered alphabetically by 
symbolic name. There are two additional lists 
which link numeric elements as either integer 
or real constants. This set of element lists pro- 
vides two significant features : 

1. The symbol look-up is more efficient since 
only the set of symbols with the same 
initial letter are considered ; 

2. Fully alphabetized symbolic cross-refer- 
ence listings and memory dumps are easily 
provided. 

Statement Lists 

Each statement record is chained onto two 
lists : 

1. The entry list consisting of all statement 


records in source sequence (i.e., ordered 
by “alter number”) ; 

2. One of the class lists, consisting of all 
statements of a particular class (e.g., 
arithmetic, control, DO, I/O, allocation 
declarations, etc. ) . 

The entry list is used both to control execu- 
tion sequence and to provide the proper order- 
ing when the source program is reconstructed 
from the internal form. 

The class lists are extremely useful in per- 
forming checking operations (e.g., checking DO 
loops for proper nesting and control transfers). 

List Control 

Every list is controlled by a single control 
word pointing to the first and last records. For 
the statement lists there is a small table of 
control words for statement control (see Figure 
2.4) . Another similar table controls the element 
lists. In addition there is also a master table 
controlling the symbolic names of reserved 
system symbols: library functions (e.g., SIN, 
SORT, etc.) ; built-in functions (ABS, FLOAT, 
etc.); and system subroutines (DUMP, EXIT, 
etc.). 

Addressing 

Two tables exist for control of the element 
and label identifiers (see Figure 2.4). These are 
the R-index, or internal-identifier reference 
table, and the N-index, or numeric-label table. 
For every element that appears in a program, 
an entry for its internal identifier, R, is made 
in the R-index table ; similarly, for every state- 
ment label, N, an entry is made in the N-index 
table. 

Every element or statement in the program 
can be accessed in an “asociative” manner by 
sequentially searching the lists until a match 
is found for the requested symbolic name or 
alter number. Each element or labeled state- 
ment can also be located in a “direct-look-at” 
manner^i by using the internal identifier for the 
element or label as an entry to the R- or N- 
index table. Thus the flexibility of associative 
list searching and the efficiency of direct ele- 
ment fetching are both incorporated in the 
system. 
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FUNCTIONAL DESCRIPTION 
The Scheduler 

The purpose of any real-time, multiprogram- 
ming supervisory program is to synchronize, 
control, and monitor system operation.^^. is. u 
The program is responsible for determining 
what things are to be done, and by whom, to 
what, where, and when each is to be done. It 
has the duty of maximizing system through- 
put and ensuring reliable operation, and, in this 
case, of maintaining rapid and level response 
times at the terminal consoles. 

At the nucleus of this supervisory structure 
(see Figure 2.1) is the Scheduler,!^,!® which con- 
trols the : 

1. Process Control program, which in turn 
directs the processor routines that trans- 
late and execute user programs ; and 

2. I/O Control program, which coordinates 
the communications exchange, random 
storage devices, tape units, reader, and on- 
line printer. 

The Scheduler performs continual sequential 
sampling of the subsidiary subsystems and 
maintains pertinent status data in the termi- 
nal headers. When data has been received from 
the terminal, the Scheduler examines the 
terminal header and decides whether to trans- 
mit a request to the random-storage I/O queue 
to fetch the user program (Program mode), or, 
if no program is required, save the data in a 
to-be-processed queue (Command mode). 

In either event, the Scheduler passes to the 
Process Control program all information neces- 
sary for processing the input message — such as 
locations of the terminal and program headers, 
the location of the input message, and the op- 
erating mode of the terminal. 

Even if no message has been received for a 
given terminal, its active program will be 
fetched from random storage and the Process 
Control program entered, if the terminal header 
shows that the program is in the automatic 
state (i.e., in the process of execution). After 
each return from the Process Control program 
under this condition, the Scheduler must deter- 
mine whether the automatic state should be 
terminated. 


There are two kinds of termination : tempo- 
rary, and return-to-manual. Temporary inter- 
ruption frees the system for use by another 
terminal and may occur for the following 
reasons : 

1. The allotted time interval has expired ; 

2. Input data is requested : 

3. Output buffer is filled ; 

4. An external subprogram is invoked. 

The return to manual status occurs when : 

1. An error condition occurs ; 

2. A STOP or PAUSE statement is executed ; 

3. The end of the program is encountered ; 

4. The user requests an interrupt from the 
terminal. 

The Process Control Program 

The Process Control program (see Figure 
2.7a) accepts information from the Scheduler 
and coordinates the activities of the processor 
programs. All of the appropriate Process Con- 
trol routines and service routines must be ini- 
tialized (1) to process the terminal header if 
the termnial is in the Command mode, or (2) to 
process various parts of the program header 
and list if in the Program mode. The Process 
Control program maintains an action code in 



Figure 2.7a. General Diagram of Process Control Flow 
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each header to determine the next task to be 
performed upon the user program. The 
Process Control program may require the 
input statement just received to be scanned by 
the translator; it may require continuation of 
execution, of a DUMP, or of a LIST. The user 
program may be in the ALTER mode; it may 
be in the process of being tested for certain 
conditions which may prohibit further execu- 
tion. Data for an input statement may be 
awaited or output of multiple-error messages 
may be in effect. 

When the user program is in the manual 
mode, the Process Control program has the 
responsibility of examining the process codes 
returned from the translator and of taking the 
necessary action. When the user program is 
in the automatic mode, execution may be 
temporarily halted and, in some cases, the pro- 
gram may be returned to manual-mode status 
(see above). When a subprogram ‘‘cair^ is 
made, execution halts until the next cycle for 
this terminal. At that time, the called subpro- 
gram becomes the user’s active program and is 
brought into memory in place of the calling pro- 
gram. When a RETURN is effected or if an 
error occurs, the calling program is reactivated. 



Figure 2.7b. General Diagram of Process Control Flow 
— Translator. 


In order that the user may always be aware 
of the status of his program, condition codes are 
printed at the terminal whenever a change of 
status occurs. He is notified when input (a 
statement or data) is requested; when an error 
occurs; when and why execution was termi- 
nated; when a system statement (see Figure 
2.7d) occurs (e.g., DUMP, INDEX, TRACE); 
and, optionally, when a subprogram call is 
made. When execution runs off the end of a 
program, and when a STOP or PAUSE is en- 
countered, he is informed that his request for 
interruption of execution has been recognized. 
In short, the Process Control system always 
knows what is happening in the user pro- 
gram, and continually keeps the user informed 
of the status of his job. The objective is to 
provide the remote user with a more complete 
awareness of his program’s status than is 
obtainable at a conventional computer console. 

The Translator 

Scan Routines 

The Translator (see Figure 2.7b) is responsi- 
ble for transforming the source-language pro- 
gram to the internal form.^'^* (See Figures 
2.5 and 2.6.) A preliminary scan is first used 
to identify arithmetic statements. For all 
other statements, the statement operator is col- 
lected and used to reference (via a Master Block 
routine) the corresponding master record. Con- 
trol then passes to the translation routine for 
the particular statement type, e.g., GOTO, RE- 
TURN, PRINT, DIMENSION, etc., 

Every statement’s decomposition goes 
through the same basic phases to form element 
and statement records. These involve the use 
of several service routines to collect the element 
name, find its record in a list or create a new 
record, and validate the statement and element 
usage. 

As each element in a statement is collected, 
a search is made to determine if it has pre- 
viously appeared in the program. If the ele- 
ment has been previously used in the same 
statement, the record appears on a current ele- 
ment working list; otherwise, it may be found 
on the element list in the user’s program or, 
alternately, on the master list of reserved and 
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system names. If no record is found, then the 
element is new to the program, and a record is 
created for it and placed on the current work- 
ing list. The information put into the record 
depends on the variable itself and the kind of 
statement it appears in. The mode indicators 
for an element depend either on its initial 
letter or on its appearance in an INTEGER 
or REAL declarative statement. The type and 
variable indicators depend on the statement 
type. A variable appearing in a storage-alloca- 
tion statement is nagged according to its dec- 
laration as an array, common, or equated vari- 
able. 

If all source elements have previously ap- 
peared, no new element records result from 
the translation of a statement. However, a 
statement record must always be created. The 
Master Block record for the statement type 
provides the statement code. Statements that in- 
volve a list of variable, such as DIMENSION, 
EQUIVALENCE, or COMMON, contain a 
count of the variables used followed by the R- 
number of the variables. 

Statements which involve a list of labels, such 
as: 

GO TO (5, 6, 7, 8), I or IF (J-5) 12, 3, 12 

contain an item count followed by the numeric 
labels. If the execution of a statement will 
change the value of a variable, the identifier of 
that variable is placed in a special field in the 
statement (see Figure 2.6). 

Statements containing arithmetic expres- 
sions or input/output lists involve specialized 
decomposition routines. The master Trans- 
lator passes to both these routines essentially 
the same input: a string of words, each word 
containing either an operator or an R-number. 
The decomposition routine transforms these 
elements into an ordered set of arithmetic 
macros consisting of an operator and two op- 
erands in every word. These macros are then 
returned to the master Translator and added 
to the statement record (see Figure 2.6). The 
input/output list-decomposition routine also 
returns a macro set of executable operations, 
(A detailed description of the arithmetic de- 
composition is contained in Appendix I.) 


Scan Diagnostics 

Throughout the translation-scan phase, 
checking occurs for syntax and composition- 
type errors. Illegal statement operators and 
invalid statement forms are detected early in 
the translation. Lack of a label on a FORMAT 
statement or the presence of a label on a de- 
clarative (where control may not flow) violate 
the definition of the statement type. Illegal 
uses of variables, such as a simple variable 
name followed by a parenthesis, are detected 
by testing indicator bits in the element records. 
The same checking of element records is used 
to detect mixed-mode errors in the arithmetic 
expressions. The number of subscripts fol- 
lowing an array name is checked for agree- 
ment with the number declared in the DIMEN- 
SION statement for that variable. In general, 
the Translator detects all syntactic errors 
which are within the context of a single state- 
ment and those semantic errors which occur in 
the use of the elements in the statement. 

Link Routines 

If the statement has no errors, the Process 
Control program decides whether to save the 
statement record and its related element records 
as part of the user's program. A statement 
record is either added to the end of the entry 
and class lists or, if the ALTER mode is active, 
is inserted somewhere into these lists. To ac- 
complish this linking, space for the new record 
is found, and the address of this area, relative 
to the program area base, is inserted as the 
'‘next" address in the preceding record on the 
list. 

If the statement record is successfully put 
into the program area, the element records are 
linked to their respective lists. Every new ele- 
ment record also causes its address (relative to 
the user program base) to be entered into the 
R-index table. 

Link Diagnostics 

Before a new record is actually chained to a 
list, certain checks for consistency of referenc- 
ing are made. These are partially accomplished 
through use of the N-index table, in which all 
references to labels are recorded. For every 
label in the program there is a corresponding 
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entry in this table. In each entry there are 
two fields : the first specifies the relative address 
of the labeled statement record; the second 
specifies how the label is referenced, i.e., from 
an I/O statement, a DO statement, or a branch 
type statement. These entries are set up and 
checked before the statement is linked to the 
lists. 

Examples of the errors detected at this 
phase are : 

1. Duplication of statement numbers ; 

2. Referring to a FORMAT statement from 
a branch statement ; 

3. Referring to an executable statement from 
an I/O statement; 

4. Using an illegal statement as the end of a 
DO (e.g., a branch type) ; 

5. Referring, as the end of a DO, to a state- 
ment which precedes the DO. 

Another type of consistency error detected 
at this time is based on ordering of statements. 
To link a statement into a list, the preceding 
statement must be available. In the case of an 
ALTER insertion, the succeeding statement 
is also available. It is possible, then, to check 
for violation of such precedence rules as: 

1. Declarative statements must precede ex- 
ecutable ones ; 

2. The first executable statement following a 
branch-type statement must be numbered 
(i.e., every section of the program should 
be potentially executable). 

It is important to note that all these consist- 
ency and precedence errors are reported to 
the user immediately after the statement is 
accepted by the system. Most conventional com- 
pilers report all composition-type errors 
throughout the entire program before going 
on to check for consistency errors. In this sys- 
tem, diagnostics are provided as early as pos- 
sible. 

When the END statement is first linked to 
the program list, or thereafter at the end of an 
ALTER sequence, several specialized routines 
check completeness of control flow and data 
referencing. 


Storage Assignment 

The value of a simple variable or a constant 
is stored in the element record. However, stor- 
age for all arrays and any variable appearing in 
common must be specially assigned. Because 
of their interaction, all allocation declarations 
must be entered before storage can be assigned ; 
on the other hand, storage must be assigned as 
soon as possible since partial execution of the 
program may be requested at any time. The 
Link routine, on recognizing the first executable 
statement, assigns storage on the basis of all 
declarative statements, which are linked on the 
same class list. After an ALTER sequence in- 
volving a storage allocation the same operation 
is again performed. 

Storage Control 

When a statement or element record is to be 
linked to a list, it is moved from a temporary 
working area to the program area. Space for 
successive records or data storage is at first 
assigned sequentially throughout the program 
area. 

When the user deletes (via ALTER) any 
statement or variable from the program, the 
associated records are unlinked from the pro- 
gram and chained to a “null” list ordered by 
size of record. When space is needed for a new 
record, the null record that best fits (i.e., large 
enough but with minimal “trim”) is selected; 
this technique prevents wasteful fragmentation 
of the null-storage areas. 

If no record on the null list satisfies the space 
requirement, but the total size of the scattered 
null records would provide enough space, then 
a “squeeze” is performed by moving every 
record in the program to a contiguous storage 
area. All references to relative addresses in the 
program area are then changed to reflect this 
relocation. 

The Interpreter 

Execution of the user’s program is done in 
an interpretive fashion^**^!’^^-^*-^^ ojj a state- 
ment-by-statement basis under control of the 
Process Control program. This control pro- 
gram sends to the Interpreter the address of 
the statement to be executed. Upon successful 
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Figure 2.7c. General Diagram of Process Control Flow 
— Interpreter. 

execution of this statement, the relative address 
of the next statement is saved in the program 
header. At this time, an indicator is turned on 
in the statement record, showing that the 
statement has been executed. 

The Interpreter can be broken into several 
parts (see Figure 2.7c) : 

1 . The master interpreter, which decodes the 
statement type ; 

2. The service subroutines used by all state- 
ment routines ; 

3. The macro interpreter used for arithmetic 
expressions ; 

4. The various statement routines. 

Decoder 

To interpret a statement, a code is fetched 
from the statement record and, using the Mas- 
ter Block, control is transferred to the appro- 
priate Interpreter routine. 

Service Routines 

These subroutines are used to fetch element 
and statement records and to address value 
words for variables and constants. In the In- 



Figure 2.7d, General Diagram of Process Control Flow 
— System Statements. 

terpreter, all fetching is done by a direct ‘‘look- 
af' of a table entry for an address. This is in 
contrast to the associative referencing used in 
the Translator. Execution speed is consider- 
ably increased by this elimination of list search- 
ing. 

All references to a variable in a statement 
record are by its internal identifier. This num- 
ber is used as a key to the R-index table to 
access the relative address of the element rec- 
ord. Whenever a value word is fetched for 
use, an indicator for 'Variable used'' is turned 
on in the element record. Similarly, if a value 
is stored into a value word, an indicator for 
"variable set" is turned on. 

Macro Interpreter 

The evaluation of an arithmetic expression 
can be expressed in hardware terms; i.e., the 
system has an instruction repertoire of six 
two-address instructions, and is equipped 
with a group of pushdown registers.^®* 27 , 2 s, 23 
Execution of the statement in the interpretive 
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mode is analogous to machine execution and 
involves several steps : 

1. Fetch the next niacro (i.e., instruction) to 
be executed ; 

2. Fetch operand values ; 

3. Decode the instruction operation ; 

4. Perform the specified operation ; 

5. Store the result in a push-down stack. 

Subscripts of arrays and arguments of func- 
tions are indicated by a special operator. When 
this operation is encountered, an entry is made 
in a push-down parameter stack. Values are 
fetched from this stack for computing an array 
address or for passing arguments to a function. 

Functions are also indicated by a special op- 
erator. When this operator is encountered, the 
function-element record is fetched. All records 
for library routines point to their actual ma- 
chine coding within the system. All other func- 
tions are called from random storage. 

Statement Routines 

Every statement has a particular Interpreter 
routine associated with it. There are several 
categories which should be discussed : 

1. Arithmetic— the macro interpreter is used 
to evaluate the expression to the right of 
the and store its value in the left- 
hand variable. 

2. Branch — the macro interpreter is used to 
evaluate the arithmetic expression for an 
IF; a service routine is used to fetch the 
value for I in GO TO (. , . .), I. The proper 
transfer point is chosen from the list of 
numeric labels in the statement. This 
numeric label is used to access the N-index 
table for the relative address of the state- 
ment to which control should flow. 

3. DO loops — ^the initial execution of a DO 
statement creates an entry in a push-down 
stack controlling DO nesting ; it also initial- 
izes the value of the DO index variable 
and flags its element record as an active 
DO index. The execution of the last state- 
ment in the range of a DO is detected 
through checking of an indicator turned 
on in the translation process. After execu- 
tion of this last statement, the DO state- 


ment is fetched again and its index is 
tested and incremented. Execution con- 
tinues with the statement following the 
DO until the indexing condition is satisfied. 

4. Input/Output — an I/O macro interpreter 
is used to compute addresses of values to 
be passed to the appropriate input/output 
service routine. A table is generated in 
the execution process to handle variables 
in the list controlled by implied DO’s. 

Execution Diagnostics 

Choosing the interpretive approach to ex- 
ecution necessarily means sacrificing speed. 
For debugging purposes, this is not often a 
serious impediment — especially since diagnos- 
tics are possible for many errors never detected 
in conventional execution. These include de- 
tecting : 

1. A value word not being set before used; 

2. A subscript value not being valid ; 

3. A DO index being reset in the range of 
the DO ; 

4. A computed GO TO parameter not being in 
range ; 

5. The size of an integer exceeding its limits ; 

6. The existence of an illegal value in an I/O 
list with implied DO’s. 

Input-Output Control System (IOCS) 

The prime responsibility of the Input-Output 
Control system is to select, from the respective 
queues built up by the Scheduler, the next task 
or combination of tasks to be performed by the 
individual I/O units. Upon completion of a 
given task, the Scheduler is notified either di- 
rectly through program switch indications or 
indirectly through the terminal header. Before 
relinquishing control to the Scheduler, the 
IOCS initiates the next task for that channel 
device based on the queue information. It also 
maintains control surveillance over all I/O 
buffer areas to prevent overflow. 

The I/O attachments consist of disk, drum, 
magnetic tape, card reader, on-line printer and 
communications exchange. 

The disk is used as a permanent storage for 
user programs. The drum serves as a rapid 
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access storage device for the repeated shuffling 
of user programs in and out of memory. The 
magnetic tapes, card reader, and printer are 
used in a conventional manner. 

The communications exchange has some in- 
teresting capabilities not found in more con- 
ventional I/O equipment. 

The Exchange 

The IBM 7740 communications control sys- 
tem^^'^i is used to buffer and control the traffic 
flow between the communications network and 
the IBM 7040 computer. 

It is a stored-program computer with a 
rather specialized instruction repertoire de- 
signed for real-time applications. The instruc- 
tions possess powerful logic and data manip- 
ulating facilities, through somewhat limited 
arithmetic capability. Instructions are fixed in 
size, one instruction per 32-bit word, while data 
is composed of strings of 8-bit characters. Ad- 
dressing is at the character level, up to a maxi- 
mum of 64K characters (i.e., 16K words). 

The 7740 program performs several com- 
munications-oriented functions. First, it ac- 
complishes line and terminal control by gen- 
eration, recognition, and manipulation of con- 
trol characters, in order to establish a connec- 
tion to the remote terminals, and to determine 
the operation to be performed. Second, it pro- 
vides message control, so that the messages 
may reach their intended destinations : they are 
logged in, monitored for correctness, and con- 
verted from the various transmission codes to 
the codes acceptable to the other devices in use. 
Third, it provides protection to ensure the 
proper disposition of messages, and to ensure 
the correction of transmission errors wherever 
possible. 

To simplify these functions, the 7740 has 
several hardware and programming capabilities 
not often found in conventional computers. 

1. The most striking of these is the ability 
to operate in an independently controlled 
hierarchy of modes. In increasing order 
of priority (that is, decreasing order of in- 
ter ruptability), these are: 

a. The normal mode. The normal activ- 
ities involved in polling, addressing, 


and monitoring of all communications 
devices are conducted in this mode. 
Because of the large number of lines, 
processing is on a continuous service 
basis, whereas a conventional computer 
attains I/O overlap by yielding inde- 
pendent control to the devices and 
servicing them on an interrupt basis. 

b. The I/O mode. This mode is used to 
control input/output between the 7740 
and the 7040. A special uninterruptable 
state, called copy mode, is used for the 
actual transmission of information, 

c. The attention mode. This mode is en- 
tered when service (not connected with 
any hardware malfunction) is needed 
(e.g., servicing the interval timer) . 

d. The service mode. This mode is en- 
tered if malfunctions are detected. 

Entry to the service, attention, or I/O 
copy modes may be initiated by the ma- 
chine ; entry to any mode may also be initi- 
ated by the program. In addition, it is 
possible to inhibit mode change so that 
tables or programs used in several modes 
may be protected (this is analogous to dis- 
abling a channel on a conventional com- 
puter). 

Associated with each mode is a pair 
of machine registers which contain the 
complete status information. Mode change 
is automatically accomplished by storing 
this information into the cells associated 
with the old mode and picking up the cor- 
responding information from the cells as- 
sociated with the new mode. 

2. Time-stamping, essential to control in any 
communications or real-time environment, 
is provided for by the interval timer, which 
is automatically updated by the machine 
every few milliseconds. This timer is used 
in conjunction with attention-mode pro- 
grams to provide a programmed real-time 
clock, and a programmer-accessible in- 
terval timer. 

3. Information about each of the communica- 
tion channels (or lines) is maintained in 
fixed positions of core storage using two 
channel-control words, one pair for each 
line involved. The current status informa- 
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tion of these words is manipulated by both 
the hardware and the programs in order 
to control the flow of information within 
the system. 

4. In order to facilitate the acquisition and 
transmission of data, the memory of the 
7740 is considered by the hardware to be 
divided into blocks of 32 characters, each 
of which begins on an 8-word boundary. 
The first 30 characters of each block are 
used to store data, while the last two pro- 
vide a 16-bit chaining address used to in- 
dicate where the next block of information 
is located. These chain addresses, supplied 
by programming, are used by the hard- 
ware to advance automatically to the next 
character location. 

Because storage is not infinite, it is pos- 
sible to place a special indicator in the 
chain-address location of any block. When 
this buffer-block signal is detected by the 
machine in the process of acquiring a new 
block, automatic entry into the attention 
mode occurs, thus enabling the program 
to accurately control the available storage 
pool. 

CONCLUDING REMARKS 

The Translator described performs a map- 
ping of a source program to an equivalent, list- 
structured, internal form. This method may 
be called “selective’’ or “differential” compil- 
ing, because statements may be inserted, re- 
placed, and deleted without retranslating the 
entire source program. In addition, this ap- 
proach provides rapid, comprehensive refer- 
ence and diagnostic data. And finally, the 
process is reversible; the source program may 
be regenerated in its original form, or in a 
related form .^2 

Interpretive execution provides the means 
for complete source-language debugging. In- 
formation on the dynamic behavior of data use 
and control flow can be applied to improve opti- 
mization of the generated object code. 

The implementation and description of the 
remote-computing system has naturally been 
done in a time-sharing context. Nevertheless, 
the techniques used are equally applicable to a 


conventional compiler operating under a moni- 
tor system. 

Standard hardware devices in a conventional 
configuration were adapted to this purpose 
through programming. However, system per- 
formance could be substantially improved by 
use of a special machine organization designed 
to perform the same functions. 
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APPENDIX I— EXPRESSION DECOMPOSL 
TION/RECOMPOSITION 

Introduction 

The primary purpose of any formula trans- 
lator is to reduce expressions to a form that 
provides the fundamental order in which op- 
erations should be performed to produce cor- 
rect results. Implicit in this form should be 
a record of the order in which partial results 
are developed, accumulated, and reused. 

The techniques and traditional program- 
ming tools generally applied to accomplish this 

"SS, 34, 35, 36 

1. Forward scan ; 

2. Push-down list ; 

3. Forcing tables ; 

4. Ordered macro list ; 

5. Implied push-down temporary indica- 
tions ; 

6. Chaining and string concatenation. 

Forcing tables are used to produce an order of 
operation based on the real or assumed 
hierarchy of arithmetical or mathematical op- 
erators. Push-down lists in this respect often 
work on a LIFO (last in-first out) principle. 
Macros are used as a form which approaches 
as nearly to a machine-executable form as can 
be used while retaining its machine-independ- 
ent structure. In addition to the operator and 
the operand elements, the macro form often 
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contains a reference to the temporary result 
that the operation will produce, such as T1 or 
T2, implying a push-down order to partial re- 
sults. String-manipulation techniques of chain- 
ing and concatenation are often employed to 
facilitate translating operations. 

Requirements 

The Arithmetic Translator includes not only 
the traditional decomposition but also a re- 
composition^^ phase to restore the statement or 
expression to its original form from the com- 
pressed macro string generated during decom- 
position. The macro string generated, there- 
fore, must satisfy several requirements : 

1. It must be easily interpreted, saving time ; 

2. It must be compact, saving space ; 

3. It must be recomposable. 

The decomposition translator must detect 
all errors in logic and syntax. It must supply 
the number, order, and mode of all operations 
to be performed by the Interpreter. 

The recomposition translator should develop 
a string in the original sequence; all necessary 
punctuation must be restored. In short, it 
must produce a string identical in all respects 
to the original, except for the removal of re- 
dundant parentheses. The resulting string, 
when decomposed again, should produce a 
macro string identical to that originally de- 


composed. 

OPERATOR 

DECOMPOSITION 

RECOMPOSITION 

Symbol 

Nome 

Left Op 

Right Op 

Old Op 

New Op 

+ 

plus 

5 

5 

2 

1 

- 

minus 

5 

5 

2 

] 

* 

multiply 

4 

4 

3 

2 

/ 

divide 

4 

4 

3 

2 

.* 

exponentiation 

4 

3 

3 

3 

so 

subscript 

6 

0 

7 

1 

fo 

function 

6 

0 

7 

1 

- 

replacement 

7 

0 

7 

1 

urn 

unary minus 

5 

1 

2 

1 

, 

comma 

6 

5 

0 

1 

( 

left paranthesis 

6 

0 

0 

0 

) 

right paranthesis 

0 

5 

0 

0 

O 

end of message 

0 

7 

0 

1 

Note: 

(EOM) 

Tbe zero code signifies that the operator is illegal when appearing in the 


speclffed role. 


Figure 2.8. Forcing Tables for Translator. 


Techniques 

Forcing Tables 

The forcing tables in Figure 2.8 are used as 
follows. The decomposition table is used to 
cause the* generation of macros based on the 
relative hierarchy of related or successive op- 
erators. If the value for the right operator is 
equal to or greater than the value for the left 
operator, then a macro based on the left opera- 
tor is generated. 

The recomposition table is used to decide 
when parentheses are necessary to maintain the 
hierarchy implicit in the order of macros pre- 
viously generated. If the value for the new op- 
erator is greater than or equal to the value for 
the old operator, then the string developed 
around the old operator during a previous con- 
catenation must be enclosed within parentheses 
before further concatenation can take place. 

Push-Down Lists 

The lists used in the decomposition trans- 
lator are the operator and variable lists which 
hold those elements awaiting further action 
from a forcing situation. The recomposition 
translator has an operator list used essentially 
for the same purpose. In addition, it uses two 
lists which contain control words of partial 
strings awaiting further action. The ‘Vork 
list” contains the control words of strings which 
are to be concatenated into a single string with 
a single control word. This control word is 
then placed on the ''string list” until a later 
call for further concatenation is encountered. 

Macro Strings 

Each macro contains an operator byte and 
one or two variable bytes. The operator is a 
basic operation plus an indication of the modes 
of the variable bytes. Either or both variable 
bytes may contain a temporary indication. 
These do not have to be specific temporary in- 
dications, since owing to the ordered structure 
of the macros, both the Interpreter and the re- 
composition translator use a push-down ac- 
cumulator for storing and fetching partial re- 
sults of execution and partial strings developed 
through concatenation. For the same reason, 
no indication need be kept in the macro of the 
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temporary to be generated by the operator 
(such as T1 or T2). To save space, macros with 
temporary indications may be compressed dur- 
ing packing procedures by indicating left and/ 
or right temporaries in the operator byte, thus 
eliminating all bytes for temporary indications. 
For example, when the macro string generated 
in the example is compressed in this manner, it 
results in 22 bytes, or, at four eight-bit bytes 
per word, less than six words of storage. The 
Interpreter accesses macros — in order — for ex- 
ecution of the statement, building up temporary 
results and using them in turn when later 
macros call for them. An EOM (End of Mes- 
sage) operator signals the end of the macro 
list. The recomposition translator accesses the 
macros and builds up temporary strings in 
much the same manner as the Interpreter. Also 
from this simplified, compact macro form it is 
but a simple step to generate machine-language 
code; either temporary locations can be im- 
plicitly addressed by the machine itself or else 
explicit storage addresses can be used. 

Chaining and Concatenation 

In the recomposition translator, when an 
operand is not an intermediate temporary, it is 
developed as an element in the output string 
and placed in an empty word in a pool. It is 
treated as a one-element string and assigned a 
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X = ((-B) + SQRTF(B**2-4. *A*C))/(2.*A) O 

MACRO OUTPUT STRING PUSH DOWN LISTS 


OP 

VI 

V2 

OP 

Var 

um 

B 


= 

X 




( 

X 




Jr 

temp 




jirtC 

SQRTF 


( 

fo 


Figure 2.10a. Example of Decomposition — Part I. 

control word. When strings are to be joined 
together, the last word of the first string refers 
to the linking operator (which is developed in 
the empty pool), and in turn, the operator 
refers to the first word of the next or preceding 
string. The two or more control words are 
combined into one which references the first 
and last words of the concatenated chain or 
string of elements. When the recomposition 
translator eventually encounters the EOM op- 
erator, there is only one chain represented by a 
control word on the string list. This chain 
or scrambled string is then unraveled into a 
sequential list of all the elements in the re- 
composed statement or expression. 


vii. 

viii . 

ix. 


* xi. 

* xii. 

* xiii. 
xiv. 

XV. 

xvi- 

xvii. 

xviii. 

* xix. 

XX. 


Mixed Mode 

Mixed Mode in a Function Argument 

Illegal Use of Function or Array Name Without Arguments 

Simple Variable, Constant, or Expression followed by left Parentheses 

liiegol Mode of Function Argument 

Illegal Number of Arguments in Function 

Fixed to Float Exponent 

Level of Nesting of Functions Exceeds Maximum Number of Eight (8) 

Illegal Successive Operators 

lllegol Parenthetical Order 

Uneven Number of Parentheses 

General Syntax Error 

Expression Begins with Illegal Operator 

Mode of Variable Not Set 

Mode Not Set For Any Arguments in Function 

Number of Parameters in Function Exceeds Declared Maximum 

Number of Arguments in (Defined) Function Specified to be Zero 

Mode of Actual Argument is Not Set 

Illegal Operator in Parameter or Illegal Position for Comma 
A**B**C condition - (illegal in FORTRAN) 


Only these errors cause an immediate error return. 
All others return for ftjrther error checkim 


ul others return for lurther error checking. 

Figure 2.9. Arithmetic Translator Diagnostics. 


Diagnostics 

The decomposition performs complete diag- 
nostic checking. Wherever possible, error 
checking continues even though some errors 
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Figure 2.10b. Example of Decomposition — Part II. 
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I 

X = ((-B) + SQRTF(B**2-4.*A*C))/(2.*A) O 

MACRO OUTPUT STRING PUSH DOWN LISTS 


OP 

V7 

V2 

OP 

Var. 

um 

B 


= 

X 

♦* 

6 

2 



* 

4. 

A 



* 

temp 

C 


Jprtfp 

- 

temp 

temp 




temp 




fo 

SQRTF 




+ 

temp 

temp 


tpnfp 

temp 


Figure 2.10c. Example of Decomposition — Part III. 

have already been encountered. (See Figure 

2.9 for a list of decomposition diagnostics.) 

Decomposition Rules (Figure 2.10) 

1. When all action has been taken with a new 
operator or variable encountered in the 
forward scan, it is placed on the appropri- 
ate push-down list. 

2. An array name or function name followed 
by a left parenthesis generates two addi- 
tional operators for the operator list: a 
subscript operator or function operator, 
and a comma operator for the* initial pa- 
rameter. 

3. When the forcing value of a new operator 
equals or exceeds the forcing value of the 
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Figure 2.10d. Example of Decomposition — Part IV. 


last operator on the operator list, action is 
taken to output a two- or three-byte macro : 

a. The last operator on the operator list 
and the last one or two variables on the 
variable list, depending on the opera- 
tor, are removed and incorporated into 
an output macro. 

b. For each macro generated for the out- 
put string, except for comma-operator 
macros, a temporary indication is gen- 
erated on the variable list. 

X = ((-B) + SQRTF(B**2-4.‘A*C))/(2.*A) O 

MACRO tNPUT STRING PUSH DOWN LISTS 


OP 
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V2 

OP Work String 

um 
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*• 

B 

2 
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* 

4. 

A 



temp 

C 

X 

- 

temp 
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X 


temp 


C 

fo 

SQRTF 


4.*A 

- 

temp 

temp 


• 

2. 

A 


/ 

ten^ 

temp 


= 

X 

temp 


O 

temp 



Current 



New Op 




Figure 2.11a. Example of Recomposition — Part I. 

Recomposition Rules (Figure 2.11) 

1. On each new macro encountered in the for- 
ward scan, the right operand (V2), if it 
exists, is always considered for action be- 
fore the left operand ( VI) . 

a. If Vi of a macro is not a temporary 
indication, it is developed in a word 
from the empty pool, and assigned to a 
control word which is placed on the 
intermediate work list. 

b. If Vi of a macro is a temporary indica- 
tion, the last control word on the string 
list is removed and placed on the work 
list. 

2. For any operator in a macro except the 
comma operator, the last action taken is to 
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Figure 2.11d. Example of Recomposition — Part IV. 


Figure 2.11b. Example of Recomposition — Part II. 

place the operator temporarily on the op- 
erator push-down list, and to combine the 
control words on the work list, linking 

cs'fvinrra 

ls:iU^XX4.^0 wrgV' VXXV/X XXXUV/ V/VfXJ. t/1. V./J, 

word, which is placed on the string list. 

a. For subscript and function operators, 
the name and parameter strings refer- 
enced on the push-down work list are 
linked in order, separated by appropri- 
ate parentheses and commas. 

X = {(-B) + SQRTF(B**2-4.*A*C))/(2.*A) O 
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Figure 2.11c. Example of Recomposition — Part III. 


b. For arithmetic unary or binary opera- 
tors, the one or two strings referenced 
on the work list are linked with the 
operator. 

3. Whenever a control word is removed from 
the string list, an operator is removed 
from the operator list and tested against 
the new operator from the current macro. 

a. When the right forcing-value of the 
new operator equals or exceeds the left 
forcing-value of this last operator from 
the operator list, parentheses are placed 
at the ends of the string referenced 
by the control word just placed on the 
work list. 

b. If the string represents V2, and the left 
forcing-values of the new and last op- 
erators are equivalent, parentheses are 
placed at the ends of the string. 

Extensions 

There is no limit to the length of the state- 
ment string that can be used as input to this 
type of decomposition translator. 

Any mathematical language based on hier- 
archical rules of operation — ^for purposes of 
computation similar to that in arithmetic 
formulas — can be decomposed and recomposed 
just as easily using forcing tables and the other 
traditional techniques. The macro form pro- 
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duced could be of quite a different form de- 
pending upon the nature of the interpretive 
scan. It would, of course, have the same implied 
order of operation. The operators involved 
need not be only unary or binary operators; 
they need not be only arithemtical or functional. 
Boolean operators, logical operators, ternary 
operators, or any others could be easily handled 
in this manner. 
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INTRODUCTION 

The multicomputer programming techniques 
discussed in this paper were conceived and im- 
plemented in a large scale tactical data system 
developed for the U.S. Navy. Many of the de- 
tails of this system are classified and cannot be 
discussed here.* However, it can be stated that 
the subject system is a man machine complex, 
primarily intended for fleet air defense and 
surface operations and maneuvering. The ob- 
jectives of the system are: to provide com- 
manders of forces afloat with a broad picture 
of the current tactical situation; to assist in 
directing operations in time to intercept and 
destroy potential threats; and to present the 
means to coordinate various weapon systems 
in a combat environment. These are achieved 
by automating, to a high degree, the collecting, 
processing, exchanging, and evaluating of large 
quantities of data through use of computers 
and digital data processing techniques. 

The system objectives are accomplished in 
real-time. Data are received by the system 
from various sensing devices such as shipboard 
and AEW radars, sonar, IFF equipment and 
ECM equipment which are in continuous con- 
tact with the outside environment. Data enter- 
ing the system are processed, analyzed and used 
by the system to influence or alter an event dur- 
ing the progress of that event, i.e., real-time. 


"This paper was orifirinally prepared for presentation 
at the 19()1 Eastern Joint Computer Conference. 
Security regulations prevented its release until now. 


Fundamental in the design philosophy of this 
system is the Unit Computer concept. In es- 
sence, the concept is a recognition of the need 
for computers of varying capability among 
ships of various types and operating modes. It 
solves this requirement by the use of standard 
computers operating in multiples to obtain in- 
creased capacity and functional capability, 
rather than use of several different computers 
each possibly of a different shape and size. 
However, in solving this varying computer 
capacity requirement, the Unit Computer con- 
cept established the need for multicomputer 
programs. 

This paper concerns the programming prob- 
lems encountered in designing and implement- 
ing operational programs requiring more than 
one Unit Computer and describes the techniques 
developed to solve two intrinsic problem areas : 

• EXECUTIVE CONTROL IN A MULTI- 
COMPUTER COMPLEX 

• DATA TRANSFER BETWEEN COM- 
PUTERS 

A third problem is assignment and distribu- 
tion of tasks between computers. To deal with 
this subject in the detail required would neces- 
sitate discussing classified material. That this 
is a problem of complexity should be obvious. 
However, it is evident that distribution of tasks 
between computers is a system design problem 
which must be uniquely solved for each multi- 
computer system. 
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It should be emphasized that the multicom- 
puter programming techniques discussed in this 
paper have been tried and tested for the last 
five years. Several multicomputer programs, 
employing two and three computers, have been 
delivered to the Navy. Thus, multicomputer 
programming, as discussed in this paper, is not 
theory, but actuality. 

In preparation for the subject of multicom- 
puter programming, it is first necessary to dis- 
cuss pertinent characteristics of the Unit Com- 
puter developed by UNI VAC to satisfy the Unit 
Computer design philosophy. 

I. THE UNIT COMPUTER 

The Unit Computer is a general-purpose, 
stored program, solid-state machine.* Only 
those characteristics most pertinent to the sub- 
ject of multicomputer programming are dis- 
cussed in this paper. 

Real-time Clock 

Among the features that make possible the 
use of the computer in real-time applications is 
the real-time clock. The clock is contained in a 
special register within Magnetic Core Memory 
and can be referenced or set by the computer 
program. Time is maintained accurate to the 
nearest 2-^^ second. The clock is incremented 
automatically by the computer upon signal of a 
1/1024 cps crystal controlled oscillator. 

When the system is started, the clock is set to 
zero and time is automatically maintained rela- 
tive to the start of the system problem. Al- 
though the modulus of the clock is approxi- 


mately 7 days, the real-time modulus of the 
system may be somewhat less. The real-time 
modulus of the system is so defined that in 
scheduling tasks for execution (see discussion 
under Section II — Program Control in a Multi- 
computer Complex), the modulus of the clock 
will not be exceeded. 

In a multicomputer program, the real-time 
clocks within all computers are synchronized. 
Upon initiation one computer transmits the 
content of its real-time clock to the other com- 
puters, which then set their clocks accordingly. 
DATA TRANSFER LOGIC 

A total of 14 input and 14 output channels is 
provided in the Unit Computer. Of these, two 
input and two output channels are especially 
designed for intercomputer communications. 
Each input channel and output channel consists 
of 30 data lines and 3 control lines. The special 
input/output channels differ from a normal 
input/output channel primarily in use of the 
control lines and in the associated channel logic. 

The input/output logic of the Unit Computer 
requires that peripheral equipment request each 
input or output data word transfer. The com- 
puter then acknowledges receipt of the input or 
availability of an output. All input channels, 
including the special input channels for inter- 
computer transfer, and all normal output chan- 
nels use Request and Acknowledge logic. 

The control lines associated with request and 
acknowledge logic are shown in Figure 1. Thes6 
are Input Data Request Line and Input Ac- 
knowledge Line for normal input channels and 



Figure 1. Normal Channel Configuration. 


*A more thorough description can be found in “Data- 
mation,’* December 1961, p. 45. 
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Output Data Request Line and Output Ac- 
knowledge Line for normal output channels. In 
addition to these two lines normal input chan- 
nels have an External Interrupt control line and 
normal output channels have an External Func- 
tion control line. The External Interrupt con- 
trol line allows the external equipment con- 
nected to that channel to signal the computer 
of its immediate data transfer requirement 
which is honored by an interruption of the com- 
puter program. The External Function control 
line is used to specify a function desired of the 
external equipment connected to that output 
channel. An External Function Word to a tape 
control unit, for example, may specify Rewind 
Tape Unit 2, 

The two special output channels use Ready 
and Resume logic for intercomputer data trans- 
fer control. The transmitting computer signals 
that it is Ready with an output data word which 
is interpreted by the receiving computer as an 
Input Data Request signal. The receiving com- 
puter upon accepting the transfer will send 
back an Input Acknowledge signal which is in- 
terpreted by the transmitting computer as a 
Resume. This reverse logic is necessary so that 
the transmitting computer will exhibit to the 
receiving computer the same control and timing 
characteristics as do the peripheral equipment. 

The control lines associated with Ready and 
Resume logic of the intercomputer channels are 
illustrated in Figure 2. These are Input Data 
Request (or Ready) and Input Acknowledge 
(or Resume). In addition, intercomputer chan- 
nels utilize an Input Buffer Status signal. The 
Input Buffer Status signal originates in the 
receiving computer and is defined by the state 
of the Input Buffer Active/Inactive Designa- 
tor. The transmitting computer can sense the 
Input Buffer Status signal, thus determining 
status of the receiving computer’s input buffer 
on that intercomputer channel. 

A special output register (Cl), incorporated 
in the computer, alternately services the two 
output channels reserved for intercomputer 
communication. All output intercomputer com- 
munications are time-shared through this spe- 
cial output register which holds data until a 
Resume signal is received from the receiving 


computer. However, if a Resume signal is not 
received within 32 to 64 seconds, the register 
is automatically cleared and the computer is 
notified of this condition by an Internal Inter- 
rupt called the Intercomputer Failure Monitor. 
This 32- to 64-second time interval is necessary 
to insure that intercomputer output with other 
computers is not suspended indefinitely. The 
Interrupt utilizes special entrance registers in 
Magnetic Core Memory for notification of 
faulty intercomputer output channels and to 
distinguish which of the channels is faulty. 

Data transfer in or out of the Unit Computer, 
whether between computers or between com- 
puter and external equipment, is handled by 
buffered transmission of data with timing un- 
der control of the receiving computer or the 
external equipment respectively. The buffering 
process transfers consecutive words, starting 
at a given initial address through a given ter- 
minal address, on a specified input or output 
channel. A single computer instruction will 
initiate a buffer mode of data transfer. Once 
established, buffer transmission employes inde- 
pendent access to memory ; the entire buffering 
operation proceeds to completion with no addi- 
tional program references. Thus, the buffer 
mode of data exchange provides for input/out- 
put operating asynchronously with the main 
computer program ; the computer continues 
execution of program instructions in the nor- 
mal sequence. 

The sequence of data and control signals for 
a normal transfer of data from one computer 
to another would proceed as follows : 

a) Receiving computer sets Input Buffer Ac- 
tive Signal ; 

b) Transmitting computer detects Input 
Buffer Active Signal; 

c) Transmitting computer places data on 30 
data lines ; 

d) Transmitting computer sets Ready which 
becomes Input Data Request in Receiving com- 
puter ; 

e) Receiving computer detects Input Data 
Request ; 

f ) Receiving computer samples 30 data lines ; 

g) Receiving computer sets Input Acknowl- 
edge line (returned to Transmitting computer 
as Resume) ; 
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h) Transmitting computer senses Resume 
line ; 

i) Transmitting computer drops data lines 
and Ready line. 

Steps c) through i) of this sequence are re- 
peated for every data word. Input Buffer Ac- 
tive remains energized during the entire trans- 
fer of a block of words. 

In most cases it is desirable for the program 
to be informed of the completion of a particular 
buffer transmission. This is made possible by 
internal interrupts generated within the com- 
puter. If an input buffer mode via a specified 
channel was established with a buffer monitor, 
an Input Buffer Monitor Interrupt is generated 
wheh the buffer mode terminates (i.e., trans- 
mission has been completed a^s indicated by the 
current buffer address equal to the terminal 
address). Likewise, an output buffer estab- 
lished with monitor would generate an Output 
Buffer Monitor Interrupt when the buffer mode 
terminates. A Buffer Monitor Interrupt is as- 
sociated with each input channel and each out- 
put channel; a separate entrance register in 
computer memory is reserved for each Inter- 
rupt. The unique entrance address thus defines 
the source of the Internal Interrupt request for 
action by the appropriate Interrupt routine. 

COMPUTER INSTRUCTIONS 

In addition to instructions normal to arith- 
metic, logic, and index operations there are 
several instructions which add tremendously to 
the power of the computer in a real-time appli- 
cation. Those instructions which are mentioned 
or inferred later in the paper are now described. 

Repeat — The Repeat instruction sets up a 
repeat mode whereby the instruction immedi- 
ately following the Repeat instruction is exe- 
cuted n times, as specified in the Repeat instruc- 
tion. In initiating the repeat mode, n is trans- 
ferred to a special register which retains the 
number of executions remaining throughout the 
repeat mode. If n is zero, the instruction to be 
repeated is skipped. This instruction also en- 
ables automatic address modification (incre- 
ment, decrement, or increased by a pre-set con- 
stant) of the repeated instruction after each 
individual execution. 


Ordinarily the repeated instruction searches 
a table for coincidence. If coincidence does not 
occur, the repeat mode terminates and the in- 
struction following the repeated instruction is 
executed. If coincidence occurs, the repeat mode 
terminates and the instruction following the 
repeated instruction is skipped. This method of 
searching a table for coincidence is used by the 
Executive Routine in controlling the execution 
of various tasks of the main program, as dis- 
cussed later. A principal advantage of a re- 
peated search is the reduction in instruction ex- 
ecution time since a memory reference is not 
required to read in a next instruction. As used 
in the Executive Routine, a repeat search takes 
8 + 11.2 (N) microseconds where N is the table 
item number on which coincidence occurs. A 
non-repeated search to accomplish the same end 
would require 8 + 40 (N) microseconds. Thus, 
the time required for program control is re- 
duced by an approximate factor of 3 through 
use of the Repeat instruction. 

Compare — This instruction compares the 
signed value of the operand with the signed 
value contained in either or both the arithmetic 
registers, A and Q. Skipping the program's 
next sequential instruction is also allowed by 
this instruction if the Skip condition is met. 

The Compare instruction is used with the 
Repeat instruction to perform the search de- 
scribed previously. The execution time given 
in that description is for a repeated Compare. 
As will be described later in Section II of this 
paper, the time at which a task is scheduled to 
be executed is compared with the contents of 
the real-time clock. 

Return Jump — The Return Jump instruction 
provides for transferring control to a specific 
set of instructions (subroutine), executing that 
subroutine, and returning control to the next 
instruction following the Return Jump. The 
Return Jump makes possible the modular pro- 
gram construction used whereby each sepa- 
rately defined task is implemented by a subpro- 
gram, which is referenced by the Executive 
Routine, and the identical subprogram is used 
in each program performing that task. A sub- 
program is a high-level subroutine; a subpro- 
gram usually references lower-level subrou- 
tines. 
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External Function — The External Function 
instruction has two meanings, one for inter- 
computer channels and another for normal 
input/output channels. If the instruction spec- 
ifies an intercomputer channel, the connected 
computer is interrogated as to the status of its 
input buffer on the connected channel. If the 
interconnected computer's input buffer is ac- 
tive (i.e., an input buffer has been initiated and 
the connected computer is ready to receive 
data), the next instruction following the Ex- 
ternal Function is skipped. If the connected 
computer’s input buffer is not active, the next 
instruction (which is normally a Jump) is exe- 
cuted. 

For normal input/output J^hannels, the oper- 
and of the External Function instruction (nor- 
mally a control code) is transmitted to the ex- 
ternal equipment connected to that channel. 

Store Input Channel — -This instruction stores 
the contents of the specified input channel at 
the address specified by the operand. An Input 
Acknowledge signal is then sent over the speci- 
fied channel thereby informing the external 
equipment of the computer’s availability to re- 
ceive additional data and to indicate that the 
previous input was received. Since buffer mode 
of data transfer is normally used, the Store In- 
put Channel instruction is used primarily to 
generate Input Acknowledge signals to Ex- 
ternal Interrupts. 

Initiate Input Buffer (With Monitor) and 
Initiate Output Buffer (With Monitor) — These 
instructions establish an input or output buffer, 
respectively, via the specified input (or output) 
channel. Subsequent transfers of data, executed 
at a rate determined by the external device, are 
made directly into (or occur directly from) 
Magnetic Core Storage starting at the address 
specified by the operand. The storage address 
initially established is advanced by one for each 
individual transfer. The next current address 
is maintained throughout the buffer process in 
the lower half word of the Buffer Control Regis- 
ter for the specified channel ; the last address of 
the transfer is maintained in the upper half 
word of the same Buffer Control Register. Each 
input channel and each output channel has a 
unique Buffer Control Register associated with 
it. The buffer mode will continue until it is 
superseded by subsequent initiation of a new 


buffer via the same channel or until the upper 
and lower half words of the Buffer Control 
Register are equal. Should the latter occur, a 
Buffer Monitor Interrupt is generated, causing 
the computer program to be interrupted and 
the Buffer Monitor Interrupt routine for that 
channel to be executed. 

Multicomputer programs use the Initiate In- 
put Buffer (With Monitor) and Initiate Out- 
put Buffer (With Monitor) instructions exclu- 
sively in transmitting data between computers. 

II. EXECUTIVE CONTROL PHILOSOPHY 

The system which made necessary multicom- 
puter programming techniques has a number 
of functions that must be performed. In ac- 
complishing these functions, the design objec- 
tive is to use both men and machines to best 
advantage. Repetitive and routine operations 
are performed automatically by machines, 
whereas decisions of tactical importance are 
made by men. Although the focal point of the 
equipment complex is one or more computers, 
the system also employs high-speed digital data 
communication facilities, radar video proces- 
sors, analog-to-digital data converters, digital- 
to-analog converters, and typical computer pe- 
ripheral equipment such as magnetic tape and 
Teletype,* all of which provide for automatic 
inputs to the computer and/or automatic out- 
puts from the computer. A complete display of 
complex and data entry devices allow men to 
monitor an ever-changing, yet accurate picture 
of the current tactical situation and to enter 
their intelligence and tactical decisions into the 
computer. 

All equipment is connected directly or in- 
directly to the computers. For each function 
the computers must : 

• receive data from the appropriate system 
equipment, consistent with data transfer 
rates of each equipment ; 

• correlate, process and (as necessary) eval- 
uate data in performing the specified sys- 
tem function ; 

• present the results of processing and eval- 
uation to men for their interpretation and 
decisions ; 

• Teletype is a trade mark of the Teletype Corpora- 
tion. 
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transmit pertinent data and human deci- 
sions, as appropriate, regarding that data 
to the ultimate users, again consistent 
with data transfer rates of the equipment 
involved. 

The system operates in real-time, therefore, 
operational requirements dictate the response 
times for individual system functions. 

To relate a real-time system to real-time data 
processing then, one might say that real-time 
data processing is processing done immediately 
as a result of an input for the purpose of pro- 
viding an essentially instantaneous response or 
output.* Processing must, as a result, be com- 
pleted at a rate greater than the input data rate 
or else the processing will fall behind so that 
eventually the system will be operating out of 
real-time. The system saturation point is 
reached when processing rate equals the input 
rate. 

From the foregoing discussion, it is clear that 
any real-time system must have, at least, a data 
processing capability sufficient to handle the 
average input rate from all input functions. 
Moreover, since system inputs occur randomly, 
it must be assumed that all inputs could occur 
simultaneously. This would represent the peak 
load. The problem then, is how to schedule, or 
queue, the functions to be performed in such a 
way that under peak load conditions all func- 
tions would be performed within their real-time 
requirements. 

An Executive Control Philosophy has been 
developed and implemented to solve the schedul- 
ing (or multiprogramming) problem. Execu- 
tive Control Philosophy is a general term — ex- 
plained specifically it means a recognition of the 
system’s tasks in order of system priority ; dis- 
tributing them among the system’s computers ; 
and then controlling them in the individual 
computers of the system by an Executive Rou- 
tine within each computer. 

The Executive Control Philosophy is based 
on the fact that each function and concomitant 
handling of peripheral equipment defines a 

'"^DiscussiOns of the concept of real-time can be found 
in “Computers and Automation” — September, 1963, p. 
26, and “Datamation”May and June, 1963, pp. 29 and 
28 respectively. 


separate task (or set of tasks) that must be 
performed by the computers. 

Each system task is performed by a computer 
subroutine (or group of routines) called a ‘‘sub- 
program.” By definition, a subprogram is any 
subroutine referenced directly by the Executive 
Routine. 

An Executive Routine is contained in each 
computer of a multicomputer program and 
maintains complete control of tasks assigned to 
that computer. Over-all control of a multicom- 
puter program by a Master Executive Routine 
is not employed. Rather a method of reciprocal 
control is used. One of the system tasks (i.e., 
subprograms) within each computer in a multi- 
computer system is concerned with intercom- 
puter data transfer. The function of each inter- 
computer subprogram is to process the data re- 
ceived from an interconnected computer and 
upon completion of this processing to initiate 
the return data transfer. To effect reciprocal 
control, program logic must be such that the 
residual state of intercomputer transfer with- 
in each computer is input. Thus, the computer 
which has control of the intercomputer data 
transfer (i.e., next to initiate an output buffer 
transfer) actually has temporary control of the 
two-computer operation. Reciprocal control, 
therefore, means that control is assumed by one, 
then the other, of two interconnected comput- 
ers. In systems utilizing three or more com- 
puters, reciprocal control would be used for 
each pair of interconnected computers. Details 
of intercomputer data transfer are discussed in 
Section III. 

Subprograms within a computer are con- 
sidered for execution on the basis of their pri- 
orities as system components. System priority 
for subprograms is relative, and is determined 
in the following manner: If tasks correspond- 
ing to subprograms A and B have response re- 
quirements of 100-milliseconds and one second, 
respectively, then subprogram A would have a 
higher system priority than subprogram B. 
System priorities for all other subprograms (or 
groups of subprograms associated with one sys- 
tem function) are chosen similarly. 

Normally, tasks associated with processing 
inputs to the computer from system equipment 
(and outputs where stringent timing require- 
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ments are imposed by external equipment) have 
highest priority. This is particularly true if 
failure to accept inputs when available means 
loss of important, non-repeated data and pos- 
sible system malfunction. However, in some 
cases where inputs are under more positive con- 
trol of the computer or of a repetitive nature it 
is possible to lower the priority of equipment 
handling subprograms to insure processing of 
previously received data before new inputs are 
accepted. This has the advantage of effectively 
shutting off or delaying inputs during brief 
periods of saturation. Consider for example 
two subprograms associated with the display 
function: one subprogram periodically inter- 
rogates displays for manual action requests and 
the other processes these requests and gener- 
ates the proper response. By assigning the 
processing subprogram a higher system pri- 
ority than the interrogation subprogram, the 
rate of processing will automatically control 
the interrogation rate. Only under extreme 
saturation for a prolonged period would the 
operator even notice the delay; thus under nor- 
mal operation the system response time for the 
display function would be honored. 

For the reasons given above — to protect 
against loss of important non-repeated data and 
to insure processing of previously received data 
within the real-time response requirements — 
intercomputer subprograms are assigned a 
relatively high priority. Conversely, each com- 
puter might contain a subprogram of lowest 
priority to perform such operations as pro- 
gram checking when no other system task 
required execution. 

Consider a list of all subprograms ordered 
by priority. The Executive Control Philosophy, 
applied in an elementary form, would dictate 
that this list be scanned sequentially, executing 
subprograms when they are needed. After each 
subprogram execution, the scanning process 
would be resumed starting with the highest 
priority entry. Thus, under a peak load condi- 
tion where all subprograms may require execu- 
tion, the top priority subprogram would be 
executed instantaneously, the next highest im- 
mediately following, and so on. The resultant 
response time of each subprogram would be the 
sum of previous execution times for all higher 
priority subprograms, recognizing that some 


of the higher priority subprograms may be 
executed more than once. The assumption must 
be made that sufficient computational capa- 
bility exists to perforin all assigned tasks. If 
this is not true then complete penetration of 
the list will not be realized and the only solu- 
tion is additional computers. If computational 
capability is sufficient to handle the average 
input rate for all functions, then under peak 
load conditions higher priority tasks are per- 
formed first and lower priority tasks are de- 
layed until time is available. Thus, a lower 
priority task may be defined as not having a 
critical response time so that late execution 
will not seriously degrade the system. An ex- 
ample of such a task is output to displays which 
might need to occur at a rate of 15 times a 
second to maintain a flicker-free presentation. 
However, during peak load, which probably 
would last for less than a second, reducing the 
output rate to 14, 13, or less would normally go 
unnoticed. 

As stated earlier, the real-time requirements 
of any system task are such that processing is 
done within the required response time. The 
Executive Control Philosophy takes advantage 
of this characteristic by providing the option 
of assigning a variable priority to a subpro- 
gram corresponding to a particular task. Con- 
sider again the subprogram corresponding to 
task A. Subprogram A could be executed upon 
demand as a top priority item, thereby com- 
pletely satisfying the response requirements. 
But recall that the maximum response time 
tolerable was 100 milliseconds. A response 
from subprogram A would therefore be accept- 
able as late as 100 milliseconds after the origi- 
nal demand. Since the present computer load 
may be quite high, it might be desirable to de- 
fer execution of subprogram A for, say, 50 to 
75 milliseconds. This is done in the same pri- 
ority list described earlier, except that a time 
element is included in the ordering of the sub- 
programs. Multiple entries for subprogram A 
can be made in the priority list as follows : An 
entry is made as low priority item for execu- 
tion immediately, as an intermediate priority 
item to be executed in 50 milliseconds, and as 
a top priority entry to be executed in 75 milli- 
seconds. Thus, after a processing demand has 
been received, subprogram A will be executed: 
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a) immediately out of the low priority entry, 
if the computer load is low; 

b) out of the intermediate priority entry in 
50 milliseconds, if the computer load is moder- 
ate; and 

c) out of the top priority entry in 75 milli- 
seconds, under peak load conditions. 

This assures that under all load conditions 
subprogram A will be performed within the 
required response interval. It should be noted 
that the above procedure will result in auto- 
matic ‘'smoothing'' of the processing load, and 
will still allow execution of each subprogram 
in the absolute minimum time, consistent with 
system priority and existing computational 
load. A subprogram is executed only when its 
“flag" is set. By definition, a flag is set when 
the time at which the task is scheduled to be 
executed is equal to real-time. 


2) When the Executive Routine detects that 
a flag is set, it reschedules the time when the 
next flag for this subprogram will be auto- 
matically set. This mechanism allows for tasks 
that must be executed at an approximate peri- 
odic rate, as mentioned earlier for output rate 
to displays. 

3) One subprogram may set or partially set 
the flag of another subprogram. This mecha- 
nism is used when it is either necessary or 
desirable to execute certain associated subpro- 
grams in a definite sequence. For example, a 
subprogram which processes data placed in 
.computer stores would be flagged by another 
subprogram which acquires that data through 
interrogation of an external equipment. Ex- 
tending this same example by adding an addi- 
tional processing subprogram, it may be neces- 
sary to complete both processing subprograms 
before reinitiating the interrogation subpro- 
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Figure 2. Special Channel Configuration. 


Four mechanisms are provided to set these 
flags. Each subprogram normally uses but one 
of these mechanisms. 

1) External Interrupts or Internal Buffer 
Monitor Interrupts set the flags of subpro- 
grams associated with servicing external equip- 
ment and most data transfers. Consider the 
task of transmitting data over a communica- 
tion link. Communication terminal equipment 
would send an External Interrupt to the com- 
puter thereby signaling the start of a trans- 
mission period. The computer must have data 
to be transmitted available to the communica- 
tion equipment within a certain time period. 
If the communication link is not operating, 
there is no need to execute the communication 
transmit subprogram. 


gram. Since interrogation is periodic it would 
automatically be rescheduled by mechanism 2, 
above. However, a lockout (significant time 
period exceeding the real-time modulus of the 
system) could be added to the periodic interval; 
each processing subprogram when completed 
would partially set the flag by subtracting part 
of the lockout time. Thus, when both subpro- 
grams have been executed, the lockout is re- 
moved and interrogation will be repeated as 
per the periodic interval. 

4) A particular subprogram may reset its 
own flag if, after receiving control, it deter- 
mines that more than an allowable amount of 
processing time is required to complete the 
task. This mechanism is utilized only by lower 
priority subprograms and is designed to guar- 
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NOTE: 1. 

Subprograms D, E, F, J, and K are of a periodic nature. 


2. 

Entries in the ETT are based on a 24 hour real-time modulus of the system. 

3. 

The 24 hour entry in ETT represents a very large delay. 

This makes sure the flag is cleared 


automatically after the first reference. 



Figure 3. Example of Executive Table Structure. 
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antee meeting the more stringent response time 
requirements of higher priority subprograms. 
When the Executive Routine again considers 
this task for execution on the basis of priority, 
control will be returned to enable further proc- 
essing. The cycle will continue until all proc- 
essing of this task is completed. 

The Executive Control Philosophy is imple- 
mented through a unique method of table con- 
trol. Three tables — Executive Flag Table 
(EFT), Executive Time Table (ETT), and 
Executive Jump Table (EJT) — contain entries 
for each assigned subprogram which are or- 
dered by system priority. Each computer in a 
multicomputer program would have its own 
set of three tables with the subprogram entries 
unique to that computer. Figure 3 illustrates 
the one-to-one correspondence between the 
tables and also illustratesjh^se of a variable 
priority and lockout technique discussed earlier. 

For a given subprogram, (n), EFT„ contains 
the time (per real-time clock) at which the 
subprogram is to be executed, ETT^ the auto- 
matic delay before the subprogram is to be 
repeated, and EJTn the address of the subpro- 
gram. If the subprogram is to be executed 
at other than a periodic rate, the content of 
ETT„ is a sufficiently large number (usually 
the real-time modulus of the system) so that 
the task will not be repeated until one of the 
other flagging mechanisms resets the flag. In 
operation, the Executive Routine sequentially 
compares the computer's real-time clock against 
the entries in the Executive Flag Table, start- 
ing with the task of highest priority. In the 
case of a “hit" (the clock is greater than or 
equal to EFT„), the corresponding Executive 
Time Table entry is added to current time, the 
sum is stored in EFT, and control is trans- 
ferred to the corresponding subprogram as 
specified in EJT. 

If the “hit" corresponded to a multiple entry 
subprogram, all EFT entries corresponding to 
that subprogram would be reset. When control 
is returned to the Executive Routine, or if no 
hit occurs, the search of EFT is repeated, 
starting again with the task of highest priority. 
Figure 4 is a flow diagram of Executive Rou- 
tine logic. 


The description of Executive Routine opera- 
tion, as discussed in this paper, is short — but 
complete. Main control loop of the Executive 
Routine consists of eight instructions, one of 
which is the powerful Repeat instruction used 
in the search operation. The efficiency of a 
repeated Compare and the extremely small 
number of additional instructions necessary in- 
sure that only a minimum amount of computa- 
tional time is usurped for the important area 
of program control. 

III. DATA TRANSFER BETWEEN COM- 
PUTERS 

To accomplish intercomputer communica- 
tions, stringent control must be exercised in 
establishing and terminating intercomputer 
buffers in each computer, thus insuring that 
data transfer between interconnected com- 


V 



Figure 4. Executive Control Program. 
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puters will stay synchronized. Interlocks and 
check points for intercomputer communication 
are required in the hardware and in the pro- 
grams. Special characteristics of the computer 
are combined with the program to foment this 
multicomputer concept. 

As can be seen from the delineation of hard- 
ware cii^racteristics in Section I, significant 
and powerful means are available to produce 
an efficient program for intercomputer com- 
munication. Internal Interrupts provide ex- 
cellent program efficiency, in that they obviate 
periodic sampling requirements for detecting 
“end of buffer.’' Usually, within microseconds 
of buffer termination, the main program is in- 
terrupted to permit execution of program con- 
trol tasks required by buffer termination. 

In the following paragraphs, program charac- 
teristics and logic will be discussed to show the 
program’s use of the computer’s capability for 
data transfer. 

PROGRAM CHARACTERISTICS 

Elements of the logic governing the design 
of intercomputer programs are variable length 
hujfers and recip7'Qcal cGntroL Fundamental to 
obtaining variable length buffers are the 
formats— message and buffer — and the method 
of controlling buffers. Reciprocal control means 
that control of the buffer operation is assumed 
by one, then the other, of two interconnected 
computers. In accomplishing this “ping-pong” 
operation, program logic is such that the re- 
sidual state of the intercomputer channel is in 
the input mode. 


INTERCOMPUTER FORMATS 
Message Format 

There are two types of information contained 
in the message formats: control bits that indi- 
cate the type of format and the validity of the 
information, and data bits which are the actual 
parameters and operands being transferred 
(see Figure 5). Basic message formats used 
are: 

a) Two- word- — This format provides an effi- 
cient control-bit/data-bit compromise and is 
used for most types of transfer ; and 

b) Multi-word — This format allows for any 


data configuration that will not adapt to the 
two-word format. 

Both formats have a common control bit 
area in the lower 15 bits of the first word. 
These bits are used to: 1) specify exactly what 
action is required on the data contained in the 
format, and 2) detect bit errors in the inter- 
computer transfer. Note that if the format is 
specified as a two-word format, the next con- 
trol word can be located easily without any 
further decoding. If, however, the format is a 
multi-word format, the next control word must 
be located by knowing the number of data 
words (found in the upper 15 bits of the first 
word) contained in the format. Use of the 
entire control word reduces data transfer effi- 
ciency. Up to 45 bits of data can be contained 

CONTROL 
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Figure 5. Intercomputer Data Formats. 
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in the two-word format, whereas only 30 bits 
of data can be contained in the first two words 
of a multi-word format. 

Buffer Format 

The message formats are superimposed on 
the intercomputer buffer format in chronologi- 
cal order. The output buffer format* s first ivord 
contains the first and last address in ivhich the 
data are to be put in the receiving computer 
(see Figure 6) ; next follow all the message 
formats. The last word is a unique end code 
to help determine whether errors have been en- 
countered in control of the data transfer. This 
buffer format is thus variable in length, pro- 
viding an efficiency in transmission time. Two 
buffer areas are provided in each computer 
for each transmission and one buffer area is 
provided for each reception. Thus, when an out- 


put buffer is active out of one area, the program 
may be packing the next buffer. In the case 
of '‘receive,'' only one buffer area is required 
since (as will be discussed later) all processing 
of a previous "receive" buffer is completed be- 
fore another receive buffer will be initiated. 

BUFFER CONTROL 

A major control problem encountered by the 
program is synchronization of the buffers in 
the communicating computers. This control 
problem arises: 1) when the intercomputer 
data buffers are initiated, and 2) when the buf- 
fers terminate. Between initiation arid termina- 
tion, the buffers are maintained in synchroniza- 
tion by the data control signals. The method 
of initiating the data transfer is the main key 
to establishing synchronization between com- 


I I I 



Figure 6. Intercomputer Buffer Format. 
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puters, because the buffer termination charac- 
teristics are defined within this initiation 
process. 

The use of the Input Buffer Internal Inter- 
rupt by the receiving computer is predicated 
upon the receiving computer's knowledge of 
the length of the input buffer. The input buffer 
length must agree with the length of the out- 
put buffer from the transmitting computer. 
Therefore, with variable length buffers, the 
transmitting computer must define the input 
buffer length for the receiving computer. 

The input buffer of the receiving computer 
is programmed to enter the first buffer word 
received into its Input Buffer Control Register. 
This is accomplished by executing an Initiate 
Input Buffer (With Monitor) instruction for 
the proper channel into the Input Buffer Con- 
trol Register for the same channel. This in- 
struction sets the initial address (the lower or- 
der 15 bits of the Inpu^ Buffer Control Regis- 
ter) the same as the address of the Input Buffer 
Control ^^egister. The terminal address (the 
upper order 15 bits of the Input Buffer Control 
Register) must be different than the initial 
address for this method to work. Before the 
transmitting computer can initiate the output 
buffer, the receiving computer must have estab- 
lished an input buffer mode. The transmitting 
computer must determine this by testing the 
state of the Input Buffer Status Line with the 
External Function instruction. 

When the receiving computer is ready to 
receive data (i.e., when all the specified condi- 
tions are met), the transmitting computer 
establishes the Input Buffer Control Word for 
the receiving computer. The first word of the 
output buffer data is the Input Buffer Control 
Word for the receiving computer. The lower 
order 15 bits define the initial address for data 
storage and the upper order 15 bits define the 
terminating address for data storage. Thus, 
the buffer area is defined for the receiving 
computer. 

The output buffer mode is initiated by execu- 
tion of the Initiate Output Buffer (With Moni- 
tor) instruction. As soon as this instruction is 
executed the first word of the output buffer 
data (the Input Buffer Control Word) is trans- 
mitted to the receiving computer. Data trans- 


fer proceeds under control of the Ready and 
Resume signals for the transmitting computer, 
and the Request and Acknowledge signals for 
the receiving computer. As soon as the trans- 
fer is completed each computer will be inter- 
rupted (as the result of the buffer monitor), 
allowing each computer to assume the opposite 
role — transmit or receive. 

PROGRAM LOGIC 

The intercomputer program logic is suffi- 
ciently flexible to allow operating any combina- 
tion of multicomputer configurations. Three 
multicomputer configurations that have already 
been successfully operated are : 

1 ) A three-computer system, 

2) A two-computer system connected by one 
pair of cables, and 

3) A two-computer system connected by two 
pairs of cables. 

Refer to Figure 7 for illustration of these 
various configurations. 

Other possible configurations for a three- 
computer system (connecting computers B and 
C) and systems utilizing four or more com- 



xwo-comfm systim cone auNNa oratAnora 



TWO^OMPUIERSYSIEM CTWO CHANNa OTERATION) 


Figure 7. Multicomputer Complex. 
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puters could use the same program logic. A 
current restriction in the multicomputer sys- 
tem discussed here is that only two computers 
can be connected to any one computer.* 

The discussion of program logic which fol- 
lows will consider the simplest case — that of a 
two-computer system interconnected by one 
channel. Variations required for other configu- 
rations will be mentioned in the course of dis- 
cussions. 

Two subprograms are required for control 
in the two-computer configuration. These are 
A-B Intercomputer Control Subprogram 
(ABCON)^ locate in the A Computer and the 
B-A Intercomputer Control Subprogram, 
(BACON), located in the B Computer. 

In addition to the subprogram in each com- 
puter, there are three Internal Interrupt rou- 
tines in each computer. They are the Input 
Buffer Monitor, Output Buffer Monitor, and 
Intercomputer Failure Interrupt routines. The 
routines are named by suffixing the subpro- 
gram name with MIN, MOUT, and FAIL, re- 
spectively. For example, the set of interrupt 
routines in the A Computer for the ABCON 
subprogram is ABCONMIN, ABCONMOUT, 
and ABCONFAIL (see Figure 8). In general, 
one subprogram and the three associated In- 
ternal Interrupt routines are required by each 
computer for each channel of intercomputer 
operation. In the three-computer configuration 
shown in Figure 7, Computer A would contain 
two subprograms (ABCON and ACCON) 
whereas Computers B and C would each con- 
tain one (BACON and CACON, respectively). 
In the two-computer system utilizing two-chan- 
nel operation, each computer would contain 
two subprograms. 

The program logic for the subprogram and 
three interrupt routines is now discussed for 
the A-to-B data transfer. Other subprograms 
and Interrupt routines are identical except as 
noted. 

*Since this paper was written, three-computer pro- 
grams have been delivered connecting all computers to 
each other. 

^ The first letter in the code name indicates the 
computer in which the subprogram is located and the 
second letter indicates with which other computer the 
subprogram com.municates. 


ABCONFAIL ABCONMOUT 




ABCONMIN 



Figure 8. Interrupt Flow Diagram. 

The ABCONMIN program is initiated by 
termination of the intercomputer input buffer. 
In order to terminate the buffer the first word 
received must contain the proper input buffer 
limits as computed by the BACON subprogram. 
The ABCONMIN program checks the last word 
received (designated by the input buffer con- 
trol register) to make sure it is a valid end 
code, thereby preventing an error in the con- 
trol signals from causing erroneous data to be 
passed between computers. If the end code is 
correct, this program sets the ABCON flag in 
the Executive Flag Table. 

The Output Buffer Monitor Interrupt in the 
A Computer indicates that data exchange from 
A Computer to B Computer is complete. The 
A-B Output Monitor routines (ABCONMOUT) 
thus entered initiates an input buffer (with 
monitor), into the buffer control register as- 
suring that the residual state of the intercom- 
puter channel is in the input mode, so that an 
output of the B Computer is accepted imme- 
diately when initiated. 
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The Intercomputer Failure Interrupt routine 
(ABCONFAIL) is initiated when the data 
transfer to the B Computer has stopped for at 
least 32 seconds, but not more than 64 seconds 
(48-second average). This could occur if an 
input request to the B Computer were missed. 
The ABCONFAIL program alerts the A Com- 
puter program that a malfunction has occurred, 
allowing either automatic or manual remedial 
action to be performed, depending on the multi- 
computer configuration involved. For example, 
if both intercomputer channels are connected 
between the same two computers, entire data 
transfer is switched to the alternate channel. 
In any case, however, the proper information 
is given to allow rapid emergency maintenance 
whenever the faulty component is isolated with- 
in the system. 

The A-B Intercomputer Control subprogram 
(ABCON) establishes control for the exchange 
of data between the A and B Computers in a 
multicomputer installation. This subprogram 
(ABCON) is flagged initially by the Input 
Buffer Monitor program (ABCONMIN) as de- 
scribed above. It is also reflagged by the EXEC 
on a 1-millisecond basis until the buffer is com- 
pletely processed. The ABCON subprogram is 
referenced directly by the Executive routine 
in the A Computer. Operation of this subpro- 
gram is independent of other intercomputer 
subprograms. For example, ACCON would be 
contained in the A Computer in a three- 
computer complex. 

The content of the input buffer contains both 
control and data information. Only the control 
data are processed by the ABCON subprogram. 
The first word of each intercomputer message 
format contains the control information. 

The ABCON subprogram performs two basic 
functions : 1) processing of the data in the input 
buffer, and 2) control of the output buffer to 
the B Computer. The ABCON subprogram 
initiates output buffer control if all data have 
been processed from the input buffer. 

If all data have not been processed, the 
ABCON program (see Figure 9) checks to de- 
termine whether the control word is valid. If 
the control word is not valid an error has oc- 
curred in the data bits on that intercomputer 
channel. 



Fif^ure 9. A-B Intercomputer Control Subprogram 
(ABCON). 


The recovery action, as described in the 
ABCONFAIL routine, can be used with the 
difference that the discovery of error has been 
made in the receiving rather than in the trans- 
mitting computer. 

The next operation determines the data 
format type so that the data can be extracted 
from the control word (only for two-word 
format). Then the index to the next control 
word is computed. This index is stored to make 
the next control word available when processing 
the next format. The ABCON subprogram 
then turns control over to the processing sub- 
routine which is specified by the format desig- 
nator in the control word. This is one applica- 
tion of the indirect addressing feature of the 
Unit Computer. The processing subroutine 
executes its designated task and returns control 
to ABCON. ABCON returns control tempo- 
rarily to the Executive routine to determine 
whether any higher priority subprograms are 
scheduled. 

This is similar to flag setting mechanism 
number 4) described in Section II of this paper. 
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If stringent timing requirements for higher 
priority tasks do not exist, all intercomputer 
messages could be processed before returning 
control to the Executive routine. 

This cycle continues until all data formats 
have been processed. At this point the B Com- 
puter Input Buffer status is tested to determine 
whether the B Computer has established an 
input buffer. If not, control is again returned 
temporarily to the Executive routine. Nor- 
mally this test is passed without delay and re- 
quirements for initiating an output buffer to 
the B Computer are begun. The unpacking in- 
dex is reset for the next input buffer and the 
ABCON flag is cleared in the Executive Flag 
Table. The input buffer control word is packed 
in the first word of the current output buffer 
and the end code is packed as the last word. 
The output buffer control word is formed, the 
output buffer packing index is switched to the 
alternate buffer area, and an output buffer with 
monitor is initiated. Control is now returned 
to the EXEC until the next input buffer is 
received. 

ILLUSTRATION OF OPERATION 

In the discussion that follows, the method of 
control of the data exchange between the A 
and B Computers is described. The same proc- 
ess is used for controlling the A-C intercom- 
puter data exchange in a three-computer 
system. 

The computers are programmed to have equal 
control capabilities, but at any one time the 
computer that has initiated the Intercomputer 
Control subprogram is in control. The Inter- 
computer Control subprogram will retain con- 
trol until all previously received inputs have 
been processed. When all inputs are processed, 
the computer in control initiates an output to 
the other computer. Control is relinquished to 
the other computer as soon as all the output 
data have been transferred. This type of re- 
ciprocating control is described as a “ping- 
pong'' control system. 

At the outset of system operation, the A Com- 
puter is selected to have initial control. The 
A Computer is forced into the output mode by 
an initialization subroutine. Conversely, the B 
Computer is started in the input mode by its 


initialization subroutine. Data, transmitted 
from the A Computer to the B Computer in the 
first buffer exchange, would include the A Com- 
puter's real-time clock reading (in order to 
provide a common time base for the entire 
system) in addition to other initializing data. 

The ABCON subprogram in Computer A 
tests the special input buffer status control line 
that indicates when the input mode has been 
selected in Computer B. Computer B, via the 
BACONMOUT program or the initialization 
subroutine, must have its Input Buffer With 
Monitor in such a manner that the first word 
received from Computer A automatically de- 
fines the area of Computer B's memory in 
which the succeeding data words are to be 
placed. When the input mode is detected, the 
ABCON subprogram in Computer A initiates 
an Output Buffer With Monitor. The first word 
always contains the first and last address of 
the input area in Computer B. Data transfer 
proceeds under control of the regulating signals 
which, in the buffer mode, operate independ- 
ently of the program operation. The monitor 
logic in both computers compare, the current 
data word’s transfer address with the last 
address that should be transferred. When 
equal, each computer's monitor logic interrupts 
normal program operation to allow immediate 
action by the respective computers. 

Computer A, which was transmitting data, 
initiates the Output Buffer Monitor program 
(ABCONMOUT), which initiates an Input 
Buffer With Monitor into its buffer control 
register, and will remain inactive in the inter- 
computer transfer until Computer B initiates 
a transfer back to Computer A. In Computer 
B the monitor initiates the Input Buffer Moni- 
tor program (BACONMIN), which in turn 
initiates the BACON subprogram, thereby ac- 
cepting the intercomputer control function 
from Computer A. The BACON subprogram 
will initiate the processing of all data, using 
the Control bits found in the first word of each 
format, to select the proper processing subrou- 
tine that is required. When all data are proc- 
essed, an Output Buffer With Monitor is initi- 
ated to Computer A. This buffer consists of all 
output data accumulated while Computer B was 
in the input mode. 
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When this buffer from Computer B to Com- 
puter A is completed, the BACONMOUT pro- 
gram in Computer B initiates an input buffer 
to accept the next output buffer from Computer 
A. At the same time, the ABCONMIN pro- 
gram in Computer A flags the ABCON sub- 
program so that the inputs can be processed. 
When the inputs have been processed by 
ABCON, a complete round-trip intercomputer 
cycle is completed. The process repeats con- 
tinuously, lending to the ‘“ping-pong"' effect in 
data transfer. 

IV. CONCLUSIONS 

Other systems, both commercial and military, 
have been and are being conceived which re- 
quire more than one computer to solve the 
user's problem. Some of these systems will 
connect two or more identical computers 
whereas other systems will connect a large 
scale central processor with smaller satellite 
computers. Consideration of multicomputer 
concepts in an original system design would 
allow for expansion of a single computer in- 
stallation into a multicomputer installation as 
the problems to be solved increase or become 
more complex. Similar considerations would 
allow for greater flexibility in a multicomputer 
installation such that, depending on the problem 
or operational requirements, computers could 
operate separately on different problems or to- 
gether to solve a common problem. The multi- 
computer programming techniques presented 
in this paper are usable as described or are 
adaptable to fit a particular system require- 
ments. 

The Unit Computer was designed specifically 
to facilitate extremely efficient intercomputer 
data transfer. However, such features as in- 
ternal interrupts upon completion of a buffer 
transmission, although desirable, are not ab- 
solute requirements. The basic requirement is 
the ability to communicate directly with an- 
other computer. Since control signals associ- 
ated with input and output to another computer 
differ from those associated with input; output 
to a peripheral device, these differences must 
be compensated for in the computer hardware. 

The computer characteristic necessary to 
implement the real-time executive control 
philosophy is a real-time clock. This would be 


an internally stored, automatically maintainea 
clock as in the Unit Computer, or an external 
clock capable of being referenced by the com- 
puter. The internal clock, repeat and compare 
instructions and external and internal inter- 
rupts included in the Unit Computer enable 
extremely efficient executive control ; however, 
of these, only a real-time clock is an absolute 
requirement in the application of the Execu- 
tive Control Philosophy to the control of a real- 
time system. If real-time is not a system re- 
quirement the real-time clock could be replaced 
by some other indexing mechanism under con- 
trol of the program, thereby enabling use of 
the same priority logic to optimize executive 
control. 

The Executive Control Philosophy presented 
in this paper is not restricted to multicomputer 
applications. The concepts are equally valid for 
a real-time system employing only one com- 
puter. 

Several extensions of the techniques dis- 
cussed will, no doubt, be obvious to the reader. 
Among these are dynamic changes in system 
task priorities as the operational requirements 
change during program operation. This merely 
means altering the order of the task list in the 
Executive stores. Included in this extension 
would be the ability to delete some tasks and 
add new tasks (subprograms) from a supple- 
mentary storage. This could be useful not only 
in a real-time command and control system 
when the nature of the battle may have 
changed but also in the scheduling of multi- 
programming type problems in a commercial 
computing center. The fact that subprograms 
are designed to be independent of one another 
with any communication or control handled by 
means of the Executive routine and associated 
tables, makes the concept entirely feasible. 

From a hardware standpoint, one modifica- 
tion considered is the ability of one computer 
to interrupt another computer. This charac- 
teristic has been incorporated in the later 
models of the Unit Computer. It has the ad- 
vantage of allowing a computer to signal an- 
other computer that a buffer transmission is 
about to be initiated. This essentially elimi- 
nates the need for the Input Buffer Status sig- 
nal and allows for more positive control of 
intercomputer data transfer. 
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1. DEFINITIONS, SYMBOLS AND 
NOMENCLATURE 

1.1 Post Algebras 

Many papers about Post algebras may be 
found in the literature. In particular, Refer- 
ences [1] to [4] cover the theory here required. 

We shall designate the Postian variables with 
lower case letters a, b, etc. While most of the 
results next given are valid for N-valued Post 
algebras, we shall consider in regard to prac- 
tical applications a three-valued Post algebra, 
that is to say, Postian variables are permitted 
to take values on a three elements linear lattice 
(Ref. [5]) (Fig. 1). Postian functions of m 
variables are conceived as the elements of the 
m-dimensional hypercube generated by m of 
such lattices (Ref. [6]). 

t ^ 


11 


*0 


Figure 1. The Three-Elements Linear Lattice. 


1.2 Opej’ations 

The entire algebra may be generated by 
using the following two Primitive operations: 

Cycling a' 

Logical Product ab 

n 

n Sk = aoai ... a,, 
k=o 


Their truth-tables are : 



Other operations to be used are : 

Complementation "a = ( (aa')"(la)')' 

where a" means ( aQ' 

Logical Sum a-t-b = ( a b ) 
n 

U a^ = ao -(- ai a„ 

k=o 

“J” Operations Jo (a) = (aa")"a" 

Ji(a) = (a'a")"a' 

_ J 2 (a) = (aap "a 

“J” Operations Jk (a) = (^ (a) ) k e {0,1,2} 
“K” Operations K (a) = 1 J 2 (a) 
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Their truth-tables are easily obtained from 
their definitions in terms of the primitive opera- 
tions. 


1.3 Canonical Forms 

From the lattice concept of Postian functions 
described in 1.1 the following canonical forms 
are obvious : 


f(Xi,X2, . . . ,x„) = 

= Uf(ki,k2 k„) Jk (xi) Jk (x2)...Jk (x„) 

— n ^ (^l» • • • r kn) (Xl) -f- Jk (X2) . -j- Jk (X^) 


where ki, k 2 , . . . , k„ « {0,1,2}, and therefore the 
symbols U and fl stand for 3“ terms. 

For the sake of conciseness we shall use the 
notation 

(kik2...k„) = Jk (xi) Jk (x2)...Jk (x„) 

The symbol 0 instead of a k in the parentheses 
will imply that the corresponding J-operation in 
the second member must be omitted. 

1.4 Basic Logical Units Symbols 
In the following, logical diagrams will be 
often used. They are formed by interconnecting 
basic units, each one performing some definite 
logical operation. The symbols adopted for such 
units are listed in Fig. 2. 

2. ELECTRICAL MODELS FOR POST 
ALGEBRAS 

The logical values 0,1,2 will be associated to 
voltage levels according to Table 2. 



"And" Gate "Or" Sate 


- Jk 

— jK(a) a — " 

K 

Jk " Gate "K 

"Gate 


r s 


t 



Jo*} Jl*} J2*} 

"3“input" Tristable Multivibrator 


TABLE 2 


Logical value 

Associated voltage 
level (in volts) 

0 

+6 

1 

0 

2 

—6 


Given a certain Postian function f (xi, X 2 , . . ., 
x„), a circuit with n inputs (Vi, V 2 , . . . , V„) 
and one output V is said to be a Model of the 
function, if for any given combination of 
voltage levels at the input, the output assumes a 
voltage such that its relation to the set of in- 
puts is isomorphic with the relation between 
the function and its independent variables. 

As usual in the Synchronous type of Com- 
puters, those voltage levels are analyzed at cer- 
tain regular intervals of time (clock pulses). 

Postian variables may be related to other 
physical magnitudes of a system, such as cur- 
rents, magnetization states, etc., provided that 
they appear in three discriminable values. 
Those systems are also models of Postian func- 
tions in the general sense stated above. The 
problem of finding the Postian functions as- 
sociated with a given system is usually called 
“Logical analysis.” 

In Table 3, adopted correspondences between 
Postian variables and physical magnitudes are 
given: 


TABLE 3 


Logical 

value 

Current 

Magnetization 

0 

Positive sign 

Positive sign 

1 

No current 

Film demagnetized 

2 

Negative sign 

Negative sign 


Figure 2. Symbols Used for the Basic Logical Units. 
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3. LOGICAL ANALYSIS 

3.1 Introduction 

In the consideration of such tevTiary systems, 
we must distinguish between combinational and 
sequential systems. The meaning of such terms 
will be the usual one.^’'^ The analysis of both 
kinds of ternary systems is already known 
and may be easily performed. The correspond- 
ence between physical and logical values is first 
established. Next, the dependence of the output 
electrical values on the input electrical values 
is obtained (to carry out this step a definite 
knowledge about the physical behavior of the 
system is needed). The truth-table follows 
easily, and from it the canonical form of the 
Postian functions can be obtained. 

3.2 Application to the Analysis of a Ternary 
Store 

To illustrate the point, we shall consider the 
logical analysis of a magnetic thin-film ternary 
store. The design is based on the technique for 
magnetic thin-film binary storage described 
in Kef. [8] . 

A Ni-Fe alloy is deposited on a cylindrical 
glass tube by evaporation in the presence of a 
magnetic field. The resulting film exhibits a 
considerable anisotropy: an ‘'easy’' (rectan- 
gular loop) direction of magnetization is estab- 
lished in the film parallel to the applied mag- 
netic field, and a “hard” (linear characteristic) 
direction perpendicular to the previous one. 

In the following, we shall consider as relevant 



Figure 3. An Isolated Specimen of Magnetic Thin 
Film, showing its magnetic characteristic on the 
X and Y Directions. 


magnitudes: 1) Input currents, 2) Remanent 
magnetization states, 3) Voltage outputs. 

In Fig. 3 an isolated specimen of a thin- 
magnetic film is represented. 

There are two exciting wires, X, Y. Current 
flowing in each wire produces a field in the 
corresponding coordinate. The “easy” direction 
is circumferential, so remanent magnetic states 
can only occur in the X-direction. 

For suitable values of X and Y pulses, the 
following results were obtained : 

1) An X-pulse does not change significantly 
any previous saturation state of the film. 

2) A “nondestructive” intensity Y-pulse 
does not change significantly any previous 
saturation or demagnetized state. But if 
the pulse is increased to “destructive” 
intensity, the film always becomes damag- 
netized. 

3) An X-pulse in coincidence with a nonde- 
structive Y-pulse produces in every case 
a remanent magnetization state in a cir- 
cumferential direction, its sign depending 
on the X-pulse polarity. 

Word selection and nondestructive reading 


Tritl 
S Y2 


Tht2 Trit5 



Word 1 


Word 2 


Word 3 


Figure 4. Word Selection Array. Three Words of 
Three Bits Each, 
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would be therefore achieved with the array il- 
lustrated in Fig. 4. Every ring can store a 
“trit"’ (Log2 3 bits) of information. Noncoinci- 
dent Y-pulses are made nondestructive, whereas 
coincident Y-pulses generate a resulting MMF 
leading to ring demagnetization. 

The process of writing should be performed 
as follows: By the X, Yi wires of the selected 
word, current pulses are sent as indicated in 
Fig. 5. The writing of a 0, 1 or 2 depends on 
the temporal position of the aiding pulse Y2, as 
can be easily understood from inspection of the 
same figure. Note that the information is re- 
placed only in the films of the selected word. 


I Y 2 reading pulse In the selected 

word 

6 , output YOltaqc. AO stored 


output voltage. A 1 stored 


Figure 6. Reading Pulse and Outputs in the 
Ternary Store. 





X pulse in the selected word 






Vi pulse in the selected word 

Y 2 pulse to write qO 



Yz pulse to write a 2 




Yz pulse to write a 1 


Figure 5. Writing Pulses in the Ternary Store. 

Reading is performed by sending a pulse 
along the Yj-wire of the selected word. If a 0 
or a 2 are stored, responses would be obtained 
as illustrated in Fig. 6. If the specimen was 
demagnetized — a 1 stored — ^the readout wire 
would not detect any change of flux and no re- 
sponse would appear. 

We shall use the following Postian variables : 

TABLE 4 


Postian 

variables 

Associated physical 
magnitudes 

X, yi, y2 

Currents in the corresponding in- 
put wires. 

m 

Remanent magnetization state of 
the thin-film. 

s 

Normalized voltage output from 
readout wire S. 


Then we may construct the truth-table of the 
store (Table 5). 


TABLE 5 


Operation 

X 

yi 

y2 

m" 

m“+i 

s 


0 

1 

0 

0 

0 

0 

Write 0 

0 

1 

0 

1 

0 

0 


0 

1 

0 

2 

0 

0 


0 

1 

1 

0 

0 

1 


0 

1 

1 

1 

0 

2 


0 

1 

1 

2 

2 

2 


1 

0 

0 

0 

1 

0 

Write 1 

1 

0 

0 

1 

1 

1 


1 

0 

0 

■2 

1 

2 


1 

0 

1 

0 

0 

0 

Read 

1 

0 

1 

1 

1 

1 


1 

0 

1 

2 

2 

2 


1 

1 

0 

0 

0 

0 


1 

1 

0 

1 

1 

1 


1 

1 

0 

2 

2 

2 


1 

1 

1 

0 

0 

1 


1 

1 

1 

1 

1 

1 


1 

1 

1 

2 

2 

1 


2 

n 


0 


2 

Write 2 

2 

n 


1 




2 

B 


2 




2 

n 

n 

0 

0 

0 


2 

B 

B 

1 

2 

0 


2 

1 

1 

2 

2 

1 


Other combinations of X, Yi and Y2 pulses do 
not occur in the rings, and therefore they may 
be considered as forbidden or don’t care terms. 
From the truth-table, the canonical forms of 
m"+* and s follows immediately. 
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In 6.3, a method for transferring the stored 
information to a MVs register shall be de- 
veloped. 

4. MINIMIZATION OF POSTIAN 
FUNCTIONS 

Minimization can be performed by an exten- 
sion of the Quine method currently used for 
Boolean functions. The Absorption law may be 
here expressed as 

U Jk(x) f = f 

ke {0,1,2} 

where f stands for any Postian function. 

In this way it is possible to obtain a set of 
prime implicants. Tabulating them against the 
function minterms, we may determine the 
essential terms, and then choose between the 
nonessential terms a set of prime implicants 
covering the function minterms not included in 
the set of essential terms. 

Redundancies (forbidden or don’t care 
terms) are treated as in the Boolean case, and 
a minim-al form of the Postian function is ob- 
tained. From this simplest form it is an easy 
matter to draw the corresponding logical 
diagram. 

As an example, we shall proceed to minimize 
the canonical form of m"+’ derived from truth- 
table 5. 

First of all, the 3^ four-variable minterms are 
grouped as in Table 6 a,b. 


TABLE 6 (a) 


m"+^ — 0 

m"+i = 1 

m"+> = 2 

0100 

1000 

0112 

0101 

1001 

1012 

0102 

1002 

1102 

0110 

1011 

1112 

0111 

1101 

2100 

1010 

1111 

2101 

1100 


2102 

1110 


2111 

2110 


2112 


TABLE 6 (b) 


F B 

0000 

1020 

2000 

0001 

1021 

2001 

0002 

1022 

2002 

0010 

1120 

2010 

0011 

1121 

2011 

0012 

1122 

2012 

0020 

1200 

2020 

0021 

1201 

2021 

0022 

1202 

2022 

0120 

1210 

2120 

0121 

1211 

2121 

0122 

1212 

2122 

0200 

1220 

2200 

0201 

1221 

2201 

0202 

1222 

2202 

0210 


2210 

0211 


2211 

0212 


2212 

0220 


2220 

0221 


2221 

0222 


2222 


Note that for the combinations of x, yi, y^, m" 
values indicated by the minterms of group 
m^+i — 0 the functions take the value 0. In the 
same way, for groups m"+^ = 1 and 2, the func- 
tions take the values 1 and 2 respectively. 
Finally, the group F B is composed by all for- 
bidden combinations. 

Next, the following steps are performed : 

1) Each minterm of group m“+i 1 is 
checked with every other in the same 
group or in group F B in order to apply 
the Absorption law. 

2) Each minterm of group m“+^ = 2 is 
checked with every other in the same 
group or in group F B in order to apply 
the Absorption law. 

3) With a similar purpose, each minterm of 
group F B is checked with every other of 
the same group. 

In Table 7 are listed all the functions gener- 
ated by one application of the Absorption law to 
all the four variable minterms. 
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Those functions arising from steps 1), 2) 
or 3) are distinguished by adding a 1, 2 or f 
respectively. 


TABLE 7 



These marked functions are checked as in 1), 
2) and 3) for a new application of the Absorp- 
tion law. In Table 8 are listed all the functions 
resulting from two applications of such law, 
with a similar indication for those arising from 
marked functions. 


Of course, those functions not discarded in 
the first absorption process must be considered 
as prime implicants. Here the only prime im- 
plicant generated at this stage is (1102). 


TABLE 8 



An exploration of the new marked functions 
for a third application of the Absorption law 
does not give us any result. Therefore, all the 
functions of Table 8 marked 1 or 2 are prime 
implicants of the original function. 

Finally, in Table 9 a, b the prime implicants 
are tabulated against the corresponding func- 
tion minterms in order to get subgroups of 
prime implicants covering all the minterms in 
each case. 


(a) TABLE 9 (b) 


m 

n 

1 


1 

n 

11 

1 

■ 



1 

1 

2 

2 

2 

2 

2 



0 


a 

II 

II 

1 




1 

1 

1 

1 

1 

1 

1 



□ 


1 

y 

n 

1 




E 

1 




u 


■ 


1 


1 

Q 

y 

1 

■ 


1 

1 

2 

2 




B 


0000 

X 

X 

X 





0012 

g 

g 

1 

B 

1 

1 

1 

1 

B 

0001 


X 


X 




2000 

1 

1 

1 

■ 

B 

B 

B 

1 

1 

1001 





X 

a 

2001 

■ 

1 

■ 

■ 

1 

B 

1 

B 

1 








2002 

■ 

1 

I 

■ 

1 

B 

B 

B 

B 








1102 

■ 

1 

IB 

IB 

B 

fl 

B 

r 

□ 









ON THE ANALYSIS AND SYNTHESIS OP THREE-VALUED DIGITAL SYSTEMS 


469 


This last step produces the following minimal 
form: m"+i = (1102) + (0012) + (2000) + 
(2001) + 1 ((0OO0)-t-(0O01) + (1001)). 

5. THE SYNTHESIS PROBLEM. COMBI- 
NATIONAL CIRCUITS 

5.1 Introduction 

The matter of obtaining an electrical model 
of a given function is currently described as 
synthesis. We shall consider combinational cir- 
cuits first. 

Of course, synthesis is not a problem with a 
single answer. As in binary synthesis, the first 
task is to design a few basic circuits, each one 
performing some particular operation. The set 
of operations implemented must be complete, in 
the sense that the entire algebra could be gen- 
erated from them (see Ref. [9]). In order to 
get reliability, we shall impose an “on-off” per- 
formance of the transistors used in the cir- 
cuitry. Reshaping of the signal in every active 
stage will be secured by using common emitter 
connections. 

5.2 Possible Operations 

Let us briefly discuss our possibilities. For 
PNP transistors w’e have the fundamental cir- 
cuit indicated in Fig. 7. The points c, e admit 
the following voltage combinations (Table 10) . 


TABLE 10 


c 

e 

—6 
— 6 

0 

0 

+6 

+6 



Figure 7. Fundamental Connection for the 
Active Stages. 


The value of the bias resistance defines the 
behavior of the circuit as Type I or II, accord- 
ing to the following table (Table 11). 


TABLE 11 


b 

Transistor 

Type I 

Type II 

+6 

off 

off 

0 

off 

on 

—6 

on 

on 


Therefore, the following truth-tables can be 
written : 


TABLE 12 


c 

e 

Type 

Input 

Output 

Postian 

Function 

2 

1 


0 

2 


2 

1 

I 

1 

2 

1 + ' 32 (b) 

2 

1 


2 

1 


2 

1 


0 

2 


2 

1 

II 

1 

1 

1 -h Jo(b) 

2 

1 


2 

1 


2 

0 


0 

2 


2 

0 

I 

1 

2 

J2(b) 

2 

0 


2 

0 


2 

0 


0 

2 


2 

0 

II 

1 

0 

Jo(b) 

2 

0 


2 

0 


1 

0 


0 

1 


1 

0 

I 

1 

1 

lJ2(b) 

1 

0 


2 

0 


1 

0 


0 

1 


1 

0 

II 

1 

0 

1 Jo(b) 

1 

0 


2 

0 



We shall implement “And” and “Or” gates 
by means of conventional diode circuits. The 
so called “Constant Functions” 0, 1, 2 are of 
course generated as their corresponding voltage 
levels. Two properly selected functions of Table 
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12, together with the previous ones, allow the 
generation of the entire algebra. 

A convenient pair appears to be Jo and Ja, 
leading to simple expressions for Ji and J 2 , 
such as 

JaCx) =J„(J 2 (x)) _ 

Jl(x) = Jo(Jo(x) 

As we see, the system of operations (Jo, Ja. 

, + ) is certainly complete. However, because 
of electronic reasons that will become evident 
later, the operation 

K(x) = 1 J 2 (X) 
will be also considered. 

Any Postian function can then be systemati- 
cally generated. In order to get reshaping of 
the signal after every degenerative stage, the 
complements of the minterms (kika. . . kn) are 
sometimes generated, either using deMorgan’s 
Laws or the relation 

Jo(Minterm) = Minterm 
Also we shall take advantage of the relation 
K (Minterm) = 1 Minterm 
Note that Jo and Ji can be generated as 

£o(x) = J 2 (£o(x))_ 

Ji(x) = Jo(Jo(x) J 2 (x)) 

5.3 Example 

The function 

Jo(x) Ji(y) + 1 J 2 (x) JaCy) 

is realized as in Fig. 8. 

5.4 Implementation 

Fig. 9 illustrates the electronic circuits per- 
forming the set of selected operations. 

6. THE SYNTHESIS PROBLEM. SEQUEN- 
TIAL CIRCUITS 

6.1 Introduction 

The Tabular and Matrix techniques^*- “ used 
in sequential Boolean synthesis can be easily 
extended to Postian systems. 

As a basic sequential unit, the electronic cir- 
cuit of Fig. 10 was developed. The analysis of 
the circuit leads to the following truth-table, 
where n indicates the order number of clock 
pulse : 


JoX- 

JlY- 


- ^ JoX4-ilY ■ JoX Ji Y j 


JaX Ji Y P s, 

JoX 1 vbXsJzY 

1J2XJ2Y ^ 


J 2 X- 

Jiy- 




J2X+J2y*j2XJ2YL 


Figure 8. An Example of Realization of a 
Two Variables Postian Function. 



Figure 9. Electronic Diagram of the Basic 
Combinational Circuits Selected. 


TABLE 14 


rn 

s" 


qn+l 

0 

2 

2 

0 

2 

0 

2 

1 

2 

2 

0 

2 

2 

2 

2 

q" 
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Figure 10. Electronic Diagram of a “3-input” Tristable 
Multi-vibrator, Selected as the Basic Sequential Unit. 

All other input combinations are supposed 
forbidden. So we shall have, after a convenient 
minimization process, the following character- 
istic equation : 

qn+i ^ 1 (000) (000) + (222) 

in the independent variables r", s", t“. 

For obvious reasons, this circuit will be called 
a “3-input’’ Tristable Multivibrator. It repre- 
sents some sort of generalization of the well 
known RS flip-flop. 

By combining “3-input” MVs. with the com.- 
binational units already described, any sequen- 
tial Postian function can be synthetized. In the 


0 

1 

2 

i 

|do| 


01 

1 

Jo 

0 

1 

2 

Jo 

5 

01 

Jo 

0 

1 

2 

Jo 

n — * 


01 

Jo 


Figure 11. Logical Diagram of a Shift Register. 


following paragraphs some examples are given 
on the matter. 

6.2 Other Sequential Circuits 

By a proper connection of its inputs and out- 
puts, a 3-input Tristable Multivibrator can be 
made to behave either like a “Cycling” MV or 
like a “Delay” MV, whose truth-tables are indi- 
cated in Table 15. 

TABLE 15 


By applying the tabular method of synthesis, 
it is an easy matter to obtain the input equa- 
tions for the 3-input MV : 


where the variables of the minterms are i, q. 

Another point of interest is the design of 
shift registers. From a logical point of view, a 
shift register is merely a chain of delay MV. 
To take advantage of the fact that in our 
3-input MV we have the three J-f unctions on 
the output, we may use the connection illus- 
trated in Fig. 11. 

The fact that any MV may be changing while 
influencing the next, is taken care of by the 


(b) 

Delay 


i 

q“ 

q"+i 

0 

0 

0 

0 

1 

0 

0 

2 

0 

1 

0 

1 

1 

1 

1 

1 

2 

1 

2 

0 

2 

2 

1 

2 

2 

2 

2 


(a) 

Cycling 


i 

q" 

q"+’ 

0 

0 

1 

0 

1 

2 

0 

2 

0 

1 

0 

0 

1 

1 

1 

1 

2 

2 

2 

0 

2 

2 

1 

0 

2 

2 

1 

i 


Simulation of 
Cycling MV 


r = (02) (21) 
srr OOO) W) 

t=(0T)W) 


Simulation of 
Delay MV 


r = (00) = J„q 
s= (10) = Jiq 

t= (m =3^ 
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delay introduced by the time constants of the 
shifter circuits. (See Fig. 10.) 

6.3 Other Examples 

Let us design a decade counter by the Tabular 
method. The left part of Table 16 represents the 


Application Equations in the form of a truth- 
table giving the desired evolution of the out- 
puts of the counter multivibrators. On the right 
we have written suitable combinations of the 
inputs, taken from Table 14. 


TABLE 16 



Then the Input Equations can be easily ob- 
tained. Once minimized, they adopt the form 

To = (022) r, = 102^ r^ = (W) 

So = 7® Si = TO^ S2 = (000) 

to = 2 t, = 7^2) ta = (001) 

The resulting logical diagram is illustrated in 
Fig. 12. 

As a final application of this method, we shall 
develop a circuit to transfer a word of the thin 
film store described in 3.2 to a Register of 3- 
input MVs. The required truth-table may be 
simplified, by proper use of redundancies, to 
(Table 17) 


TABLE 17 


Operation 

X 

yi 

72 

qn+1 

Reading 

1 

0 

1 

m“ 


0 

0 

0 

q“ 

Writing 

2 

0 

2 

q» 


MVO 


MV1 


MV2 


J2(-6V) 


1 

p 


± 

H 

n 


— 


Jo 

P 

0 1 

2 

1 - 

-=lT^ 




.3- 

Jo 

p 

L 

- O 

2 


E 






0 

1 

0 

1 


Figure 12. Logical Diagram Showing a Base Three 
Decade Counter. 
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store 



(egister 


Figure 13. Concerning to the Transfer of Information 
from the Thin-Film Store to a MVs. Register. 

That is to say, each MV copies the state of 
its associated ring only while reading. 

After obtaining and simplifying the input 
equations, we shall have the logical diagram 
indicated in Fig. 13. 

7. DESIGN OF A TERNARY FULL ADDER 

Kilbum et al. have developed a fast carry- 
propagation binary adder, now incorporated 


in the Arithmetic Unit of ATLAS Ferranti 
Computer. Let us synthesize the ternary ver- 
sion of such system. 

The truth-table of the ternary sum is indi- 
cated in Table 18. 


TABLE 18 



110 2 0 
1110 1 
112 11 


1 2 0 0 1 
12 111 
1 2 2 2 1 



Carry pro^Qation to^ic 
Staqc \ to Z 


The case c = 2 is not included because the 
propagation of any carry value other than 0 
or 1 is not possible in a full adder, whichever 
the numerical base used. 

From the table, the canonical forms of c', s 
ar obtained, and after a minimization process, 
we have 

c' = fa(x,y) c-j-fb(x,y) 
s =1 (fe(x,y) J„(c) -l-fd(x,y) Ji(c) 

-f- fa(x,y) J„(c) -1- fc(x,y) Ji(c) 

where 

fa(x,y) = (02) -f (11)+ (20) 
fb(x,y) = (12) + (21) + (22) 
f+x,y) = (01) + (10) + (22) 
f<i(x,y) = (00) + (12) + (21) 


Figure 14. Block Diagram of the Kilburn Adder. 
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■le 



of a Typical Stage, Including Normalization of 
Voltage Levels. 


With the logic illustrated in Fig. 14 the carry- 
propagation time is reduced to the sum of the 
switching time of the slowest carry-path tran- 
sistor and the time needed for the carry pulse 
to travel along the carry-path. 

Because of electronic reasons, another corre- 
spondence between Postian a'nd electrical values 
is adopted on the carry-path circuit, namely, 0 
volt for the “1” and — 2 volts for the “2”. 
Therefore, those circuits analyzing carry-path 
information must be designed to match such 
correspondence. 

The electronic diagram of a typical carry 
propagation stage is illustrated in Fig. 15. 

The sum s is easily synthesized by the method 
described in 5.3, leading to the gate circuit of 
Fig. 16. The storage of s in a MVs. register is 
also indicated. 

If the Arithmetic Unit is of the serial-parallel 



Figure 16. Generation and Storage of the Sum. 


JoX JfX 


Joy Jiy J2Y 
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-b 



To first staq« 


Figure 17, Carry Feedback on a Serial-Parallel Adder. 

type, the carry generated at the final stage at 
time n must be stored in a Delay MV in order 
to incorporate it as input carry at the least 
significant digit stage at time n + 1- Because 
of the binary character of the carry propaga- 
tion, the carry storage may be performed by 
means of a conventional binary Delay MV, as 
illustrated in Fig. 17. 
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AN ALGORITHM FOR PLACEMENT OF INTERCONNECTED 
ELEMENTS BASED ON MINIMUM WIRE LENGTH 

R, A. Rutman 
AC Spark Plug Division 
General Motors Corporation 
El Segundo, California 


INTRODUCTION 

The efficient placement of components has 
taken on increased importance because of micro- 
miniaturization techniques. Packaging tech- 
niques are permitting higher density of compo- 
nents. As the components are mounted closer 
and closer, less and less room becomes available 
between components for interconnections. Un- 
der these circumstances, the layout of circuitry 
becomes a difficult and time consuming task for 
the designer. One approach often used by de- 
signers is to group together elements which are 
functionally related so as to reduce the length 
of interconnections. If the designer could place 
the elements in such a way that every element 
connects only to its nearest neighbors, the prob- 
lem of routing the interconnections would be 
somewhat simplified. 

In general, the shorter the wires the easier 
they are to lay out. Furthermore, in the special 
case of a printed circuit board, there is a limit 
to the total length of etched wires that can be 
printed on one layer. 

Those considerations led to the concept that 
a placement of components which will minimize 
the total length of the wires would simplify the 
layout. With that goal in mind, this paper de- 
scribes a technique for the automatic placement 
of components. 

1. Description of Problem 

The motivation for attacking the problem of 


placement arose from the author’s association 
with a com.puter development project at AC 
Spark Plug. In that project, the logic mecha- 
nization of a computer used micrologic elements 
mounted on multi-layered boards. Several lay- 
ers of etched circuitry provided the necessary 
interconnections. 

Although the work done on this problem was 
for a specific application, it is felt that some of 
its features are usable in broader applications. 
Consequently, definition of some fundamental 
terminology is hereby given and generalized as 
much as possible. 

A board is defined as a finite array of posi- 
tions. A position is defined as a point in an x-y 
coordinate system. Elements are movable en- 
tities which may be placed in the given posi- 
tions. 

In these terms the problem is to place a given 
set of elements into positions on a board so as 
to minimize the total length of the interconnect- 
ing wires. 

A major portion of this paper deals with the 
application where only one element per position 
is allowed. The technique for allowing more 
than one element per position is given in Sec- 
tion 7. 

2. The Technique of Solution 
a. Steinberg’s Algorithm’ 

A set of elements which have interconnec- 


477 



478 


PROCEEDINGS— SPRING JOINT COMPUTER CONFERENCE, 1964 


tions determined by the electrical design is 
given. Assume these elements are placed on the 
board in some initial positions with an initial 
wire length. Furthermore, assume that some of 
these elements are not directly connected to- 
gether. In fact, unconnected sets of elements 
form an important part of the solution. The 
technique of solution consists of selecting un- 
connected sets and moving the elements within 
each set to reduce the wire length. 

An unconnected set can be formed in the 
following manner. First select any element on 
the board. Then choose any element which is 
not directly connected to this first element and 
add it to the set. Then choose a third element 
which is not connected to the first two elements, 
etc. The set is called maximal when the selec- 
tion is carried to the point where no more ele- 
ments can be added. Clearly there are many 
ways in which maximal unconnected sets can 
be formed. In particular, a family of uncon- 
nected sets can be formed such that every 
element is in at least one set. Section 5 contains 
more details on the formation of unconnected 
sets. 

Starting with an initial placement of elements, 
in board positions, take some unconnected set 
of the elements (possibly a maximal uncon- 
nected set) and remove it from the board. A 
set of positions on the board are now vacant. 
If there were V vacant positions before removal 
of the unconnected set, now there would be 
V 4- N vacant positions, where N is the number 
of elements in the unconnected set. 

Next, take one element from the unconnected 
set and place it in one of the vacant positions. 
The total length of the wires which are attached 
to the element in this position can be determined. 
This length is independent of the placement of 
any of the other elements in the unconnected 
set, since the given element is not connected to 
any of them. Now place this element in each of 
the vacant positions and determine the wire 
length for each position. Then do the same for 
each element in the unconnected set. This gives 
an array of N (N-j-V) values. 

The next step is to determine a placement, of 
the unconnected set in the vacant positions so 
that the total wire length for all the elements in 
the unconnected set is as small as possible. To 


do that there exists an algorithm which guaran- 
tees a minimum placement of the unconnected 
set. See Section 6 for details. 

Notice that the total wire length between 
elements which are not in the unconnected set 
remains constant during this manipulation since 
they remain fixed on the board. 

After manipulation of this unconnected set, 
the worst that can happen is that there is no 
change in the total wire length (perhaps, all 
elements in the unconnected set end up where 
they were originally). However, if there exists 
a better placement of the unconnected set, the 
algorithm will find it, and the total wire length 
will decrease. 

The process which has just been described 
will be referred to as an iteration. The next 
step in the algorithm is to choose another un- 
connected set and repeat the process. The 
algorithm continues through many such itera- 
tions until the total wire length no longer de- 
creases or decreases an insignificant amount. 

b. Improvements on Steinberg’s Algorithm 

(1) Improvement in the method of measur- 
ing wire length. The original Steinberg al- 
gorithm uses a connection matrix (Cij). For 
example, if element Ci is connected to element ej 
by three wires, term Cy of the connection matrix 
is equal to three. This method of expressing 
connection with integers is perfectly adequate 
in some cases. But in other cases, a fractional 
description of the connection is desirable. The 
author’s interpretation of Steinberg’s algorithm 
based upon examples^ is that assumes only 
integral values. Consider the example in Fig- 
ure 1. If only integers are permitted, these two 
dilferent connections would be represented by 
the same values in the connection matrix and 
hence would be treated identically in the place- 
ment process. 

For the measurement of wire length, the 
placement program described in this paper 



Figure 1. 
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makes a distinction between these two cases 
and is therefore able to converge on a more sat* 
isfactory solution for certain types of connec- 
tions. The measurement of wire length is 
treated in more detail in Section 4. 

(2) Interchange. The original algorithm 
provides a powerful method for manipulating 
elements which are not connected together. 
Sometimes elements in connected sets can be 
manipulated to advantage. Consider the exam- 
ple in Figure 2. 

If only the transposition A< — could occur, 
a more desirable placement would be obtained. 
The basic algorithm allows interchange only of 
elements which are in the same unconnected 
set. Because A and B may never be in the same 
unconnected set (since they are connected), 
the basic algorithm would not resolve this diffi- 
culty. In order to improve the basic algorithm, 
the placement program also evaluates the inter- 
change of all elements which are directly con- 
nected together. 

(3) Reducing the length of the longest wires. 
In the basic algorithm, groups of elements 
which are tightly connected to one another are 
reluctant to move. 

This is best seen by example. Suppose the 
placement problem consists of three elements, 
A and B movable, and X fixed (X might be a 



Figure 2. 
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Figure 3(a). 
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Figure 3(b). 



Figure 4. 
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Figure 5(a). Initial State. 

connector pin, for example), connected and 
placed as shown in Figure 3(a). Assume lots 
of vacant positions. Suppose B is in the first 
unconnected set. Then, at the end of the first 
iteration, the result would be as shown in Fig- 
ure 3(b). 

If A is in the second unconnected set, A can 
wander into any one of four positions about B 
where the total wire length is equivalent. The 
difficulty is that the group (A, B) will not tend 
to migrate toward the fixed element X. So 
minimum total wire length is not reached ! The 
element X need not be fixed. For example, the 
group A, B, C, and D shown in Figure 4 are all 
movable, yet will not come together unless by 
chance. 

A partial solution to this problem is obtained 
in the following way. The algorithm is first 
applied to the task of minimizing the lengths 
of the longest wires, and later it is applied to 
minimizing total, wire length. The former task 
is referred to as Phase I of the placement pro- 
gram, and the latter is referred to as Phase II. 
The diagrams in Figure 5 illustrate what hap- 
pens to the example given in Figure 3 as at- 
tempts to minimize the lengths of the longest 
wires are made in a Phase I placement. 

Note that in this process the total wire length 
sometimes increases. In the example, Phase I 
alone converged to the minimum wire length 
and Phase II would produce no further improve- 
ment. In general, however. Phase II is needed 
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0 O 

Figure 5(b). B belongs to first unconnected set 
and moves. 

0=0 © 

Figure 5(c). A belongs to second unconnected 
set and moves. 

0 = =© © 

Figure 5(d). B moves. 



Figure 5(e). A moves. The result is one of the four 
possible permutations of A. 

QHSy® 

Figure 5(f). B moves. 

to make the fine adjustments; Phase I makes 
the coarse adjustments. The exact function used 
to measure the length of the longest wire in 
Phase I is described in detail in Section 4. 

3. Co7nputer Results and Running Times 

To test these techniques the placement pro- 
gram was run using two computer boards. The 
first, referred to as the small board, contained 
70 elements and 77 positions. The second, re- 
ferred to as the large board, contained 516 
elements and 550 positions. To evaluate the 
results of each placement, the minimum point 
to point wire length was computed using the 
algorithm of Loberman and Weinberger.^ 

In the following discussion, an initial place- 
ment is described as either a random initial 
placement or a manual initial placement. The 
former was determined by use of a random 
number table. The latter was prepared by a 
skilled designer. 

Because of the larger amount of computer 
time required to place the large board, the small 
board was used for most of the comparisons. 

a. The Results of the Original Steinberg Al- 
gorithm 

The original Steinberg algorithm^ when ap- 
plied to the small board with a manual initial 


placement resulted in a 4.7 % reduction in total 
wire length. 

b. The Effect of Different Wire Lrength Meas- 
urement 

If every pin were connected to only one other 
pin, the method of measuring wire length de- 
scribed in this paper would be identical to that 
of Steinberg's. If a large percentage of the pins 
are connected to more than one other pin, the 
method in this paper would have a pronounced 
effect. In the particular case of the small board, 
a majority of the pins are connected to only one 
other pin. Thus the percentage of improvement 
over the manual initial placement is only 5.3%. 
That is, if the original Steinberg algorithm is 
used with no change other than in the measure- 
ment of wire length, the improvement is 5.3%. 

c. The Effect of Interchange 
Investigation is still underway to determine 

how often an interchange should be performed. 
Possibly an interchange should be done every 
iteration. 

As mentioned above, there is a 5.3% improve- 
ment over the manual placement with no inter- 
change. With an interchange once every 20 
iterations, there is a 6.7 % improvement. With 
an interchange every 10 iterations, there is an 
8.2% improvement. The improvements were 
measured using the Loberman and Weinberger 
technique.^ Figure 6 illustrates the effect of 
interchange upon reducing pseudo wire length. 
Two runs were made on the small board. One 
run was without interchange and one was with 
interchange every 20 iterations. 

d. The Effect of First Reducing Longest Wire 
Length 

Starting with a manual placement of the 
small board and going through Phase I and 
Phase II with an interchange every 10 itera- 
tions, the result is a 19.4% improvement over 
the manual placement. 

Starting with a random initial placement and 
going through Phase I and Phase II with an 
interchange every 20 iterations, the result is a 
17.8% improvement over the manual placement. 

e. Improvement on the Large Board 
Starting with a manual initial placement of 

the large board and going through Phase I and 
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Figure 6. Small Board (Phase II). 


Phase II with interchange approximately every 
10 iterations, there was a 39% improvement 
over the manual placement. 

f. Computer Running Times 

For the placement' of the small board, the 
computer was permitted to run to completion 
and required a total of approximately 4 minutes 
of IBM 7090 computer time. Notice that most 
of the improvement occurs in the first few 
iterations. In the placement of the large board, 
there was some question as of when to cut off 
execution of the program. The program was 
run for 2 hours 10 minutes in Phase I and for 
2 hours in Phase II. However, both phases could 
have been stopped sooner with little loss in im- 


provement. This is clearly illustrated in Fig- 
ures 7 and 8. 

4. The Technique for Measuring Wire Length 
The length of wire needed to interconnect a 
set of points depends upon the manner in which 
the routing is done. For a simple connection, 
say between point A and point B, it is assumed 
that the length of a connection is the dog leg 
distance between A and B. This is adequate for 
simple connections between two points, but as 
the number of points gets greater the deter- 
mination of wire length becomes more and more 
complex and depends largely on the specific 
point to point wiring scheme. For example, con- 
sider the case of five points A, B, C, D, and E 
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connected by one conductor as shown in Fig- 
ure 9. 

Each of the routings in Figure 9 has a differ- 
ent wire length. Steinberg,^ in placing element 
E, would treat the wire length as in routing 2. 
In treating element A, he would treat the wire 
length as in routing 4. 

The problem is still more complicated for 
printed circuits since connections can be made 
from point to wire in addition to point to point 
as shown in Figure 10. 

There does not seem to be a simple technique 
for computing the minimum wire length for all 
connections of this type. 


Loberman and Weinberger^ have an al- 
gorithm for computing the minimum point to 
point wire length, but it is too slow to use in 
the placement program. 

The placement program for the most part 
does not have to evaluate the actual wire length. 
But it must provide the answer to such questions 
as the following. 

If elements A, B, C, D, and E are connected 
together by one conductor and if A, B, C, D are 
fixed, with E able to occupy either position x or 
position y, what is the difference in the total 
wire length of the conductor if E is in position x 
compared to that of position y? Hence, of prin- 
cipal concern is the change in wire length from 



Fij^ure 7. Large Board (Phase I). 
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Figure 8, Large Board (Phase II). 

one position to another rather than in its abso- 
lute value. 

The remainder of this section deals with the 
exact function used to evaluate wire length. 

Two pins are defined to be connected if they 
are electrically connected. The fact that pin A 
is connected to pin B forms a relation between 
pin A and pin B. It is easy to verify that this is 
an equivalence relation. Elements e^ and ej are 
said to be connected if there exists a pin c and 
a pin q € ej such that pin p is connected to pin q. 
Consider the set of all pins in a given problem. 
Let connection be the equivalence relation on 
this set. This equivalence relation will divide 
the set of all pins into disjoint equivalence 


c c c c 

Routing 1 Routing 2 Routing 3 Routing 4 



Figure 9. 



Figure 10. 
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classes. Each of the equivalence classes so 
formed is called a conductor. 

Let S = [ e,, e:;, . , ei, . ef} be the given set of f 
elements. 

Let L = I lu /■,, L . , . , /,.} be the given 

set of g positions (locations) where g — f . 

Let P = { Pi, Pl*, • . . , P»i, Ps} be the set of all 
pins on all elements. 

Let C = f Cl, C-, . , Cj, . , Cl,} be the set of con- 
ductors. 

Each conductor is a subset of the set P. Also, 
each element e-, is a subset of the set P. Pin 
p,„ £ ei if pin p,„ is a pin on element e;. 

Now define the notion of a transductor Tj 
which is a set of elements connected together 
by the conductor Cj. 

Tj = ( Ci ; p,„€Cjandpn.£ei}. 

Notice that there are exactly as many trans- 
ductors as conductors and that there is a natu- 
ral mapping from conductor to transductor. 
Given a conductor Cj, merely consider each 
p„, c Cj in order. Given a p„„ there exists an ej 
such that p,» £ Cj. Therefore, the corresponding 
Oj belongs in the set Tj. Let a (Tj) be defined as 
a function which maps Tj into its cardinal num- 
ber, the number of elements in the set. 

Also define a one to one function y whose 
domain is the set S of elements and whose range 
is the set L of positions, y (e,) is the present 
location of element ei. Each position in the set 
L has two components: the x coordinate and 
the y coordinate. Thus, yx (ei) is the x coordi- 
nate of element ei, and yy (ei) is the y coordi- 
nate of element e,. 

The first step in the placement process is to 
assign all elements to random initial positions. 
After removing an unconnected set from the 
board there is a set M of available positions. 
M is a subset of L. Suppose e-, is in the uncon- 
nected set. At this stage of the placement proc- 
ess, Cj may be placed in any position of the set 
M. The placement process uses a number called 
the pseudo wire length Fj.,, for each position 
/„ £ M which indicates how well element likes 
to be in position /„ (assuming every element 
that Cj is connected to is not movable) . A com- 
paratively low number means that the wire 
length is low when e^ is in position /n. A com- 


paratively high number means that the wire 
length is high when e, is in position Phase I 
and Phase II (as defined in Section 2) use dif- 
ferent functions for the pseudo wire length. 
The functions are referred to as Fui and F/ji, 
respectively. 

At each step in the placement process the 
total pseudo wire length can be evaluated as 
follows. Suppose the current placement puts 
element ei in position t^n)- Then the total 
pseudo wire length is defined as ? Fi.,>(i). The 
object of the program is to find ‘a placement 
which will minimize 2 F j » . 

Abstractly, Fi.„ is a function of y(ek) and 
yie-x) for all k such that Ck £ Tj and k ^ i and all 
j such that Ci £ Tj and Q:(Tj) >1. Element Ci is 
assumed to be in position having x, y coordi- 
nates /.ix-and /„y. 

Fli the Phase II pseudo wire length number 
for element ei when in position /„, is given by : 


T?P = s - ^ 

I., n — / nn \ 1 ^ 

4 ill j a ( 1 j) 1 all k 

+ y,v (Ck) — („y 1 ) where e,. 
and a (T,) > 1. 


(Ivx (Sk) /ii,- I 

£ Tj, ek £ Tj k ^ i, 


To get an expression for F*„ the phase I 
pseudo wire length number, it is necessary first 
to define an auxiliary function Fk„*j for three 
separate cases. 


Case 1 : a (Tj) = 2 

Fi.i, = i yx (Ck) Jiix j ~l" i y.v (®k) /ny | 

where Tj = { ek, e,}. 

Case 2: a (Tj) = 3 

Fk„*j = max { ( I yx (Ck) — | + | y,. (Ck) — 

/ny , ) J ( ; yx (Giii) /iix j “h | y.v (^in) /ny } ) } 

where Tj = { e.,„ ek, e,}. 

Case 3: a (Tj) > 3 

For this case a pseudo location //I is first de- 
fined, its X and y components given by : 

/i.x*^ = (®k) where ek £ Tj and k # i. 

a (Tj) k 

/ix*^ = TW-r yy (ek) where ek £ Tj and k ^ i. 

a ( Ij) k 

Furthermore, let a function G,*j be given by : 

G *j = ^ V 

a(Tj)— 1 „iTk 

( 'yx (Ck) — /ix*^ I + i y,- (ek) — /»'> | ) 
where e Tj and k ^ i. 
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Then Fi,„*j can be expressed as : 

T? *1 = r ^ 2 

a (Tj) —1“"" 

( I yx (eO — /„x I + I yy (ej — /„y | )] — 

where e^ e Tj and k ^ i. 

Now the phase I measure F/.n for wire length 
is defined as : 

F,‘„ = max {Fi.„*j} 

where ei £ Tj and a (Tj) > 1 . 

One of the advantages of choosing the func- 
tions in this manner is the ease of computation. 
Another is that they tend to closely approximate 
the real situation. As an example, consider the 
element ei which has only one transductor Tj. 
Suppose a (Tj) = 2. Then F“„ is merely the 
dog leg distance between the two elements con- 
tained in Tj. 

Suppose ei is contained in two transductors 
Ti and T2 such that a (Ti) = a (T2) = 2 , Tj = 
{ Ok, ei }, and T2 = { Cn,, Ci }. That is, Oj is con- 
nected to two different elements as shown in 
Figure 11 . 

Then F”„ would be equal to seven which is the 
length of the longest wire. Suppose we also had 
Tg = { Cn,, ei } with a (T3) = 2 as shown in 
Figure 12 . 



Length = 4 Length - 7 


Fi^re 11. 

Length = 4 Length = 7 


Figure 12. 
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Here again F^n = 7 since = Fi,„*2 = 4 
and Fi,n*^ = 7 . F"„ for the case in Figure 11 
would be equal to 11, the length of both wires. 

F-^n for the case in Figure 12 would be equal 
to 15 . 

For another example, suppose ei belongs only 
to transductor Tj where a (Tj) =5 and Tj = 
{ ei, e2, e^y 64, 65 }. Then Fi” can be interpreted 
as the average distance from element ei to the 
other elements in Tj. Suppose M = { fu I2 } 
where /i and (2 are one unit apart, and both 
positions are at some distance from the other 
elements in Tj as shown in Figure 13 . 

Then F/;i — F “2 = 1 which is desirable be- 
cause the wiring would be done in a manner 
similar to Figure 14 below, and the difference in 
actual wire length between ei in h and ei in 
is only 1 unit. 

Suppose positions A and (2 and elements 62, 
63, 64, and e5 are located as shown in Figure 15 
below, that is, interior to the set. 

Then FiA — Fi,2 = 0. That is, the two posi- 
tions are equivalent and ei may be placed in 
either position with no difference in wire length. 
Once again the function approaches the real 
situation. 

For the case shown in Figure 5 , F14 = 
FI2 = 0. F^i would be greater than zero only 
if /i were a position located outside the rec- 
tangle formed by e2, e3, e4, and as in Figure 
11. For Figure 11, F{.i — F/ ^ = 1 as would 
be expected. 

© 

© © 

Figure 14. 

© © 

2>1 [a 
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Figure 13. 


Figure 15. 
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In many problems, the majority of the trans- 
ductors are such that a (Tj) = 2. But the case 
where a, (Tj) >2 must be handled consistently. 
Furthermore, it is observed that given any ele- 
ment ei there is a large probability that e-, c Tj 
such that a (Tj) > 2. Table 1 shows the result 
of a survey made on a large computer printed 
circuit board. The board had 729 transductors 
with the distribution as shown, N is the num- 
ber of transductors with the given a (Tj). In 
other words, the sets Tj were sorted into groups 
according to their cardinal number. 

5. The Procedure for Forming Maximal 
Unconnected Sets 

Given a set S of elements ei and a set of trans- 
ductors (a typical transductor being Tj) an un- 
connected set U is defined as follows : 

U { Oi, Ok c S I there is no Tj such that 
Ci c Tj and Ck c Tj }. 

There are many ways in which the set U can 
be formed. It is desirable to form a family U' = 
{ Ui, U2, U3 , . . . , Ur, . . . Uq }. Each member of 
S is contained in at least one of the sets in U'. 

Also, it appears desirable that the uncon- 
nected sets should overlap one another.^ In 
dealing with a large number of elements it is 
not feasible to form all unconnected sets. In- 
stead a technique is used which causes some 
overlap. About of the elements in each set 
will be contained in other sets. Figure 16 is a 
flow chart which shows the technique for ob- 
taining the unconnected set Uj given the sets 
Uo, Ui, U2, . . . , Uj-i. The following nomencla- 
ture is used in Figure 16. 

Eo = The original set of elements which 
are eligible for inclusion in an un- 
connected set. 

E = A working set containing elements 
eligible for inclusion in the current 
unconnected set. 

W “ A working set containing some of 
the elements not eligible for inclu- 
sion in the current unconnected set. 
Elements in W are connected to ele- 
ments which are in the current 
unconnected set Uj. 

U(, = 4>, the void set. 
a(Uj) =: Cardinal number of set Uj. 

N* nr The largest number of elements 


which may ,be contained in any un- 
connected set. This number is a 
function of the number of empty 
positions on the board and the 
amount of avialable space in the 
computer. In short, a given place- 
ment program is restricted as to the 
size of the solution matrix which it 
can solve. By restricting the num- 
ber of elements in an unconnected 
set, the size of the matrix is re- 
stricted. This means that the sets 
may not be maximal in some cases, 
j— 1 

Z = U U„, (the union of sets Ui, U2, 

m=:l 



NOTES: 

(1) Subroutine A consists of Random selection of an e^ from E, placing it in Uj, 
and placing all e^^ connected to e^ in W. 

(2) Given sets A and B then A-t' is defined as the set of elements in A but not in B. 

Figrure 16. Flowchart for Formation of 
Unconnected Sets. 

NOTES: 

(1) Subroutine A consists of Random selection of an 
Cj from E, placing it in Uj, and placing all e,^ 
connected to Cj in W. 

(2) Given sets A and B then A-B is defined as the 
set of elements in A but not in B. 
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6. The Solution Matrix 

Each iteration of the placement program in- 
volves the determination of the optimal place- 
ment for an unconnected set of elements. The 
procedure for determining an optimal place- 
ment involves operations on a solution matrix 
described in this section. The method for 
manipulating the matrix is an adaptation of an 
algorithm by Munkres.^ 

The first step in placing an unconnected set is 
to form a matrix in the follov^ing manner. Sup- 
pose the unconnected set has N elements and 
there is a set of N + V positions available for 
placement. Let the set of availabfe positions be 
indexed by n, with n running from 1 to N -f V. 
Let the unconnected set of elements be indexed 
by i with i running from 1 to N. Now form a 
matrix [A] of N rows and N + V columns. 
Each row i of the matrix corresponds to the 
element ej in the unconnected set and each col- 
umn n of the matrix corresponds to the position 
/n in the set of available positions. The elements 
ai,n of the matrix are set equal to the functions 
Fiji (or Fijn if in phase I of the placement) de- 
fined in Section 4 on measuring wire length. 

The Munkres algorithm^ results in N stars 
(asterisks) being associated with elements of 
the m.atrix [A] , These stars are positioned so 
that every row has exactly one star and no 
column has more than one star. The placement 
of the stars in the matrix describes the place- 
ment of the elements e,. Thus if element a^^g of 
the matrix has a star upon completion of the 
algorithm, then element e^ should be placed in 
position A in the set of available positions. 

The following algorithm is derived from 
Munkres' algorithm. It can handle rectangular 
matrices and includes features to speed up the 
computer solution. 

During the course of the algorithm, some 
zero-elements are distinguished by asterisks 
and some by primes. These are referred to as 
starred zeros and primed zeros respectively. 

In addition to the matrix [A] , the row vector 
Rs is required. Each element of Rs corresponds 
to a column of the matrix [A] . If the column 
contains a zero star, the corresponding element 
of Rs will contain the row number of the zero 
star. If the column contains no zero star, the 
corresponding element of R^, will be zero. 


Similarly, there are two column vectors Cs 
and Cp, whose elements correspond to the rows 
of the matrix [A]. If the row has a zero star, 
the corresponding element of Cs contains the 
column number of the zero star. If the row has 
a zero prime, the corresponding element of Cp 
contains the column number of the zero prime. 

A ‘"line" is defined as a row or a column of the 
matrix [A]. In the course of the algorithm, 
certain lines are distinguished and referred to 
as ‘"covered" lines. An element of the matrix is 
said to be uncovered, once-covered, or twice- 
covered, accordingly as it lies in precisely none, 
one, or two covered lines. 

There is the column vector C,., each element of 
which corresponds to a row of the matrix. If 
the row is covered, the corresponding element 
of C,. = 1, otherwise it equals 0. Each element 
of the row vector R,. corresponds to a column of 
the matrix. If the column is covered, the cor- 
responding element of R<. = 1, otherwise it 
equals 0. 

The coordinates of uncovered zeros in the 
matrix are contained in so-called zero lists so 
that they may be quickly located without testing 
each element of the matrix. Zero list A contains 
the coordinates of all uncovered zeros and the 
coordinates of some zeros which may or may 
not be covered. Zero list B contains the coordi- 
nates of some zeros which are currently covered 
but which may at some future time become un- 
covered. Zero list A and zero list B are push 
down lists^ with a bottom pointer. That is, 
there is a cell pointing to the bottom cell of the 
list. In step 2, zero list B is emptied and added 
to the bottom of zero list A. This operation is 
facilitated by the bottom pointer in zero list A. 

Preliminaries, In the matrix [A], no lines 
are covered and no zeros are starred or primed. 
Consider row 1 of the matrix. Find the smallest 
element in row 1, and call it h. Subtract h from 
each element of row 1. In the process of sub- 
tracting h, some element (or elements) of row 
1 become zero. Whenever an element becomes 
zero and if there is no starred zero in the row 
and none in its column, star the zero and cover 
the column containing the zero. If there is no 
zero star in its column but there is one in its 
row, add the matrix coordinates of the zero to 
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the zero A list (unless the zero A list is already 
full) . Do the same for each row of the matrix.* 
If N columns are covered, the starred zeros 
form the desired result, and the problem is fin- 
ished. If less than N columns are covered, go 
to step 1. 

Step 1 

Take the coordinates of a zero from the zero 
A list, popping up the zero A list. If the zero is 
covered, push the coordinates of the zero into 
the zero B list and take another zero from the 
zero A list. Continue until a non-covered zero 
is found or until the zero A list is depleted. If 
the zero A list is depleted and flag F is set, 
search the matrix for an uncovered zero. If the 
zero A list is depleted and flag F is reset, go 
to step 3, 

Assume an uncovered zero is found. Prime 
the zero, then consider the row containing it. 
If there is no starred zero in this row, go at once 
to step 2. If there is a starred zero in the row, 
cover the row and uncover the column of the 
starred zero. Look for any uncovered zeros in 
this new uncovered column. If any exist, add 
their coordinates to the top of the zero A list 
(if the zero A list overflows, set flag F). Then 
go to beginning of step 1. 

Step 2 

There is a sequence of alternating starred and 
primed zeros, constructed as follows: Let Zq 
denote the uncovered 0' (there is only one). 
Let Zi denote the 0* in Zo's column (if any). 
Let Zi» denote the 0' in Z/s row (if Zi exists, its 
column is not covered since Zo is in its column 
and Zf, is not covered, so its row must be covered. 
Hence there is a 0' in this row (see step 1) . Let 
Z^ denote the 0* in Z 2 ^s column (if any). Simi- 
larly, continue until the sequence stops at a 0' 
which has no 0* in its column (Munkres proves 
such a 0' exists and the sequence is unique) ). 


columns are covered, the starred zeros form the 
desired result and the algorithm is flnished. 
Otherwise, empty the zero B list into the bottom 
of the zero A list and go to step 1. 

Step 3 

Let h denote the smallest uncovered element 
of the matrix ; it will be positive. Add h to each 
twice-covered element. Subtract h from each 
uncovered element. In the process of subtract- 
ing h, add the coordinates of any new zeros to 
the zero A list. Return to step 1 without alter- 
ing any stars, primes, or covered lines. 

For an example consider the board indicated 
in Figure 17. 

Also assume a set of six elements is given and 
connected as shown in Figure 18. 


12 3 



Figure 18. 


Now unstar each starred zero of the sequence, 
and star each primed zero of the sequence. 
Erase all primes (that is, remove the zero prime 
coordinates from vector C,,). Uncover every row 
and cover every column containing a 0*. If N 

* Note that at this time the zero A list (if sufficiently 
large) will contain the coordinates of every uncovered 
zero if any exist. If there are too many zeros for the 
zero list to hold, flag F will be set. Otherwise, flag F 
is reset. 



Figure 19. 
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Suppose element e^ is fixed in position pi and 
element e^ is fixed in position p-. Then the set 
of available positions is { p-, P:{, P4, Pr., P«;, Ps, 
Ps)}. Choose the unconnected set consisting of 
elements ei, e^, e.^, and e4. For this choice form 
the matrix A whose elements are F?/,, . The re- 
sults are given in Figure 19. 


For example, the computation of F”r> in- 
volves element e^ and position Pu (x = 2, y = 3) . 
Element e- belongs to four transductors and is 
connected to element e-. (at x = 1, y = 1) and 
element e<i (at x = 3, y = 1). Hence FJV, is 
computed as follows : 


^ I 1 — 2| + ! 1 — 3| ) + ( 13 — 2i + ! 1 — 3 I ) ] 

+ — 2| + i 1-31) + (13-2| + i 1-3 i ) ] 

+ -2:^[|3-21 + |1-3|] 


To solve the matrix, first do the preliminaries. 
The result is shown in Figure 20. 

Since N — 4 but only two columns are covered, 
proceed to Step 1. 

Step 1 examines zero 1, 3 in the zero A list, 
but finds that it is covered. Step 1 finding no 
uncovered zeros transfers to step 3. Step 3 
finds that 2 is the smallest uncovered element 
and proceeds to subtract 2 from each uncovered 
element and add 2 to each twice-covered ele- 
ment (there are none) . The result is shown in 
Figure 21. 

Step 3 transfers back to step 1. Step 1 finds 
an uncovered zero in 2, 6 and primes it. The 
row contains a 0* in column 3. Therefore, the 
row is covered, and column 3 is uncovered. The 
newly uncovered zeros in column 3 are added 
to zero list A. The result is given in Figure 22. 

The next uncovered zero that step 1 finds is 
4, 3. The zero is primed, and since the row 
contains no 0*, step 1 goes to step 2. 

Step 2 constructs the following sequence : 

0'i=4, 3;0* = 2,3;0' = 2, 6. 

Primes of the sequence are changed to stars 
and stars of the sequence are removed. At the 
end of step 2, the results are as shown in Fig- 
ure 23. 


Since N = 4 and only three columns are now 
covered, step 2 returns to step 1. The next un- 
covered zero that step 1 finds is 1, 4. The zero 
is primed. Its row has a 0* in column 1. There- 
fore, the row is covered and column 1 is uncov- 
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Figure 25. 


ered. Then the zero in column 1 is added to the 
zero A list. The result is shown in Figure 24. 

Step 1 then finds the uncovered zero 3, 1 and 
primes it. Since its row has no 0*, step 1 goes 
to step 2. Step 2 constructs the following se- 
quence : 

0'=i:3, 1;0* = 1, 1;0' = 1,4. 

At the end of step 2, the results are as shown 
in Figure 25. 

Now four rows are covered and the algorithm 
is done. The placement can be read directly 
from the column vector Cs or the asterisks in the 
matrix. Element ei is placed into position Pr>, 
e^ in Ps, e;^ in P 2 , and in P 4 . Graphically, the 
result is as shown in Figure 26. 

The result was obtained through the applica- 
tion of only one unconnected set. In practice, 
several unconnected sets would be used until the 
placement finally converged. 

In summary, the purpose of the various ta- 
bles and vectors is to increase the speed of com- 
putation. The purpose of the zero tables is to 
enable the computer to find quickly an uncov- 
ered zero, without searching the matrix for one. 
Once the coordinates of a zero (i, n) is obtained 
from the zero A table, it can quickly be tested 
to see if it is covered (i.e., check element i of 
vector Ce and element n of vector Re). Primes 
and stars are recorded only in the vectors men- 
tioned, not in the matrix. They are shown in 
the matrix in the example only to make the 
example easier to follow. Thus, to erase all 
primes involves merely setting vector to 
zero. To uncover a column and to cover a row 
involves the change of only 1 cell in vector Re 
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and 1 cell in vector C,.. Similarly, recording and 
changing of 0*’s is also a simple operation. 
Thus, the set up and solution of a matrix con- 
taining roughly 140 rows and 170 columns takes 
an average of less than 1 minute of time on the 
IBM 7090 computer. 

7. More Than One Element in the Same Posi- 
tion 

Suppose there is a unit called a can which 
contains more than one logic element. For ex- 
ample, a can might hold 4 flip-flops or 6 gates, 
etc. The technique for handling a placement for 
this siutation is considered below. 

First, place the elements randomly in the 
cans according to the rules (e.g., 6 gates to a 
can). Some cans may be only partly full. Then 
place the cans on the board in some random 
initial position. Then make a family of uncon- 
nected sets for each type of element when there 
is more than one element of that type in a can. 
For example, there may be a family of uncon- 
nected sets consisting of only gates of a certain 
type. 

Then, manipulate the families of unconnected 
sets, not using any available positions other 
than the positions originally occupied by those 
elements. The result will be that elements are 
assigned to the cans more optimally than they 
were in their original assignment. Now make 
unconnected sets out of the cans and move the 
cans (using the empty board positions). Thus 
there are two types of movement. One is move- 
ment of the elements within the cans (keeping 
the cans fixed) . The second is movement of the 
cans. The placement program would alternate 
between these two types of movement. 
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DISTRIBUTED MEMORY 
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Albuquerque, New Mexico 


INTRODUCTION 

Memory techniques which have so far been 
developed with computer technology all share 
limitations imposed by the requirement that a 
memory address be specified uniquely for stor- 
age or retrieval of data. In many problems 
these addresses are merely “dummy variables^’ 
with the only meaningful order being one de- 
rived from the content of the data points them- 
selves; examples of such operations are sorting, 
interpolating and catalog look-up. Largely be- 
cause of the astronomical proportions of the 
computing task assumed by some real (and rea- 
sonable) problems in these and other closely 
allied areas, there has been an increasing inter- 
est in the type of memory in which information 
is stored (and retrieved) on the basis of con- 
tent. Storage systems with this modus oper- 
andi are generically classified as “associative 
memories.''^-® 

There are two fundamentally different con- 
ceptions of an associative memory. The first 
can be regarded as exact (in the same manner 
that a conventional computer memory is exact) 
since a normal storage address is generated 
from an examination of the data to be stored. 

In this version, an item for entry is character- 
ized by as many descriptors as possible (or as 
may be limited by the dimension of the mem- 
ory) and the entry is stored “at the intersec- 
tion’’ of these descriptors. The list of descrip- 
tors, if regarded as separate from the data 
word itself , has been given the name associative 


criterion by Kiseda et al.^® Obviously, a single 
storage word may in general have a multiplicity 
of associative criteria, or alternatively, several 
words may have a common one. This interpre- 
tation of an associative memory is concerned 
with exact data storage and retrieval (albeit 
the retrieval request may be inexact and in- 
complete) and not with “interpolation’’ between 
stored data points. 

The second conception of an associative mem- 
ory is concerned almost exclusively with an 
ability to interpolate between stored data points. 
This does not imply that exact recall of stored 
data points is no longer desired, although this 
capability is generally compromised, but rather 
that the memory should have the ability to in- 
terpolate (or extrapolate) in a field of statis- 
tically related memory entries. Biological sys- 
tems evince both types of memory processes 
(hence their common classification under the 
heading of associative memories). Apparently 
the most important design feature of the bio- 
logical system is what is termed “distributed 
memory.” Unlike a computer memory where a 
particular item is stored in a sharply defined 
location the entries into a biological system are 
distributed through relatively extended vol- 
umes of the neural material, so that even partial 
extirpation cannot erase a strongly recorded 
memory trace.” The concept of distributed 
memory has received a great deal of attention in 
the past few years, primarily because of the 
pioneer work of F. Rosenblatt on percep- 
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and the more recent researches of 
B. Widrow and J. Angell on Madeline com- 

ASSOCIATIVE ADDRESSING 

If all of the data words ^hich are to be 
stored in an associative memory could be made 
available in parallel, then by an examination of 
the individual associative criteria (descriptor 
lists or independent variables) a decision could 
be reached as to which words were to be stored 
near each other. This decision procedure, based 
on some measure of nearness between the vari- 
ous associative criteria, is precisely the opera- 
tion which is carried out in computer sorting, 
for an example. The reason that such programs 
are so costly in time and machine operations is 
that the device must, by a scanning, or moving 
shuffle, effectively simulate the parallel occur- 
rence of the data words. If an associative ad- 
dressing scheme is to be effective, it must be 
able to assign each individual memory entry, as 
it appears serially, to precisely the memory lo- 
cation to which it would have been assigned had 
the entire memory fill been available in parallel. 
It is obvious that if the associative criteria are 
inadequate for the parallel classification, no 
serial procedure can do better. 

The associative addressing scheme which 
forms the basis of this paper is extremely sim- 
ple. In lieu of having all of the data words 
available in parallel so that their associative 
criteria can be compared simultaneously, the 
associative criterion of each new word is com- 
pared, as it appears for entry into the memory, 
with a large number of bogus associative cri- 
teria which form a fixed part of the memory. 
One of the most important questions which 
must be asked concerning any particular mem- 
ory is whether that specific selection of bogus 
associative criteria is complete in the sense that 
they will cause a data word to be addressed to 
the same storage location as would have been 
done by the simultaneous comparison of all data 
words. This question is treated in detail in Ap- 
pendix 1; however, the following argument 
shows that there will always be some selection 
for which this is true. 

The input space over which the data words 
are defined will always be a finite point set as a 


consequence of a restriction on each of the de- 
scriptors or independent variables that they lie 
in a finite discrete range. Therefore, the com- 
plete memory fill, which is a subset of functions 
on this point set, is itself finite. It was assumed 
earlier that the associative criteria were ade- 
quate for the unambiguous parallel associative 
addressing of the data words. If now the selec- 
tion of fixed associative criteria included every 
word of the memory fill, then obviously by this 
assumption each data word would be properly 
addressed. Equally obviously, if the memory 
required this number of bogus criteria to oper- 
ate it would be of no practical interest. The 
foregoing argument was introduced only to 
demonstrate that for some finite selection of 
fixed criteria the data word could be assigned 
the proper associative address. 

Consider a function f (xi,X2, . . . , x„) defined 
over n variables, where each of the variables is 
restricted to some finite discrete set of values, 
as specifying the environment or source of en- 
tries for an associative memory. Then the set 
of values 

{f(xi,X2, .. .,x,») ;x,,X2, ...x„} 

consisting of the complete specification of a 
point in the input space and the value assumed 
by the function at that point is a data word. 
The second half of the bracketed expression 
{. . . ; Xi, X2, . . . , x„} 

in which the Xi elements are treated as the inde- 
pendent variables of the input space is the asso- 
ciative criterion. 

Each of the Xi above is restricted to a discrete 
set of values. Let kj be the number of incre- 
ments in the range of x*. It is an extremely 
useful convention for the purposes of associa- 
tive addressing if we think of the points in the 
input space as represented by a linear array of 
symbols 

(Xii, X,2 . . . (X21, X21> . . . X2k, ) . . . 

(x„i, x„2 . . . x„kj (1) 

where each of the parentheses represents all of 
the values in the discrete range of the corre- 
sponding variable. Every point in the input 
space will be represented as a vector of this 
form in which a single 1 appears in each pa- 
renthesis, with all other entries in that paren- 
thesis being 0. Each of these vectors has N 
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binary elements, and hence is a vertex of the 
N-dimensional unit side hypercube situated in 
the vertex of the first octant in Ex space, where 

n 

N = ^ k, (2) 

i = 1 

In the balance of this paper we shall refer to 
this as the input or S space. 

Consider two points in S space represented by 
their liijear vectors (in the same form as (1) ) ; 
X, X'. Ignoring for the moment the value as- 
sumed by the function at these jJoints, the first 
question, and certainly the most natural one, is 
“How close are X and X' to each other?'' In 
order to discuss nearness it is necessary to in- 
troduce a metric. The most natural metric for 
the points in S space as defined by ( 1 ) is 

d-'^n — X' • X^^’ (3) 

which has the simple interpretation of testing 
whether the variables are identical in each par- 
tition, and of giving as the metric the number 
of partitions in which X and X' differ. The 
parallel associative classification of a memory 
fill is based on the computation of all such dis- 
tances between entries. 

The points in the original input space, n- 
dimensional, were mapped onto a select set of 
the vertices of the unit hypercube in Ex with the 
restriction that the Hamming weight of these 
points be precisely n. Similarly the bogus asso- 
ciative criteria, or reference points in S space, 
will also be chosen to be vertices of this same 
hypercube, without, however, the restriction to 
Hamming weights of n. It is most convenient 
to consider these vertices to have been chosen 
randomly so that the reference points are uni- 
formly distributed on the hypercube. This defi- 
nition gives rise to a difficulty which did not 
exist in the computation of the distance between 
a pair of associative criteria. There, since the 
vectors, X, were restricted to have a single 1 
in each partition, the distance was either 1, 
within a single partition, or 0 depending on 
whether the value of that variable was the same 
in the two associative criteria, or dissimilar, 
respectively. Now there is the possibility that a 
single partition could contain all I's, for an 
example, in which case the distance d would 


be 0 in that partition irrespective of which asso- 
ciative criterion it was compared to. This weak- 
ness of the metric d in the enlarged space, i.e., 
the union of the permissible input S-space and 
the space made up of the reference points, does 
not prevent the model from working, as the 
analysis of Appendix 1 proves, but it does indi- 
cate that only a very small portion of the total 
information available is utilized in forming the 
associative address. We will still define dis- 
tance in this enlarged space by the metric d 
and ignore the degeneracies which can occur. 

The vertices of the hypercube chosen as ref- 
erence points are most conveniently treated as 
vectors since d- is formed by an inner product, 
and hence we will refer to any individual refer- 
ence point in the extended S space as an asso- 
ciation vector Aj. If there are R association 
vectors, then they may be best dealt with as an 
N x R binary association matrix A. The prod- 
uct of an associative criterion, considered as an 
N element row vector, and A, is a row vector 
with the R elements being the metric d- less n 
in each case. This vector is designated as the S 
vector with elements Sj. 

S=X*A (4) 

In Appendix 1 it is shown that for R sufficiently 
large it is possible to recover d- from S and S', 
and that therefore the metric is complete over 
the point set of R-tuples S. 

We will now introduce a nonlinear threshold 
operation on the elements of the vector S. A 
new (1 X R) row vector a, with elements a?, will 
be derived from the S vector by the following 
procedure : 

_ ri if Si ^ 8 
to if Sl < 8 

Thus we have generated from the product X • A 
a binary (1 x R) row vector which has a 1 in 
the i-th position if the input X is not more than 
n — 8 distant from the reference point A ,. Again 
it is worthwhile emphasizing that for R suffi- 
ciently large the vectors a and a specifying the 
distance of X and X' from the R association 
vectors Ai, A^, . . . Ak, are at the same time ade- 
quate to define the distance d- between X and 
X'. This is true in spite of the weak definition 
of the metric in the extended space as is shown 
in Appendix 1. This a vector is the desired asso- 
ciative address. 



496 


PROCEEDINGS—SPRING JQINT COMPUTER CONFERENCE, 1964 


It is possible to give a simple geometrical in- 
terpretation of the associative address. As was 
noted above the extended S space is the set of 
vertices of the unit hypercube in Ex. A hyper- 
sphere of radius Vf be circumscribed about 
every such hypercube and a central projection 
of the hypercube onto the sphere can be made 
which leaves the vertices unchanged. The most 
important point for our geometric discussion 
though is that the circle drawn through all ver- 
tices of the same Hamming weight of the 
hypercube on the surface of the hypersphere 
will be equidistant from the poles of the sphere 
at (00 ... 0) and (11 ... 1). Obviously every 
point which can exist in the extended S-space 
lies on this hypersphere. We can now map 
these hyperspheres on an ordinary sphere in 
3-space in which case there will be N — 1 circles 
around the sphere corresponding to Hamming 
weights of N — 1, N — 2, ... 2, 1. Thus the simple 
projection of the 3-variable hypercube on the 
sphere yields a representation as shown in 
Figure 1. Similarly for higher dimensional pro- 
jections we get N — 1 circles, each with 
points on it of Hamming weight, w. 



The points in S space which can be X vectors 
as given by (1) all have a Hamming weight of 
n ; however, not every point with a weight of n 
is a permissible X-vector. There are Nh points 
with a Hamming weight n and only T permis- 
sible X vectors. 


N 


II — 


f n 


z 


k. 


i = 1 


n 

T= ][ k, 

1 = 1 

In general disregarding the points of S space 
which have a Hamming weight different from 
n, we have Figure 2 as the projection of the 
hypercube on the sphere, where the symbol o 
represents permissible X vectors and the sym- 
bol • represents other points which have a 
Hamming weight of n. 

Obviously the distance, d, from any such 
point, X, to a set of vertices of the hypercube, 
Aj, could be computed. This is precisely the 


(II I) 



Figure 1. 


(II • • • 1) 



Figure 2. 


S-vector discussed above, i.e., Figure 3. The 
nonlinear thresholding operation consisted of 
testing each such distance against a reference 
S to see if it was greater than or equal to the 
reference. 

_ r 1 if S, ^ 8 

- 10 if Si < 8 

Recalling the way in which s was formed, i.e., 
Si = X • Ai, we see that s is largest for the 
closest reference points. Thus the threshold 
operation can be geometrically represented by 
constructing about X as a center a hypersphere 
of radius 8 and determining which of the refer- 
ence points lie inside or on this sphere. The 
corresponding to these points will be assigned 
a value of 1 by the threshold test. The intersec- 
tion of the hypersphere about X with the hyper- 
sphere on which X occurs is a hypersphere of 
dimension (n — 1) as is shown in the following 
projection (Figure 4) on a sphere in 3-space. 



APPLICATION OF AN ASSOCIATIVELY ADDRESSED, DISTRIBUTED MEMORY 497 


(II • ■ ■ I) 



(00 - 0 ) 


Figure 3. 


(I I • • • t) 



Figure 4. 

This completes the geometrical description of 
the mapping from the N dimensional extended 
S space to the R dimensional memory space. 
The resulting set of binary valued a metrics is 
the associative address. 

DISTRIBUTED STORAGE 

In the foregoing analysis of a possible asso- 
ciative addressing technique we specifically re- 
tricted attention to that fraction of the data 
word which specified the location of the data 
point in S-space. This base vector, X, was 
shown to preserve its metric relationship to 
other base vectors under a linear transforma- 
tion followed by a threshold operation which 
transformed X into a new vector, a. It is this a 
which we propose to use as an associative ad- 
dress in storing or retrieving data from an 
associative memory. The most important quali- 


tative point in this procedure is that the trans- 
formation maps points from the N-dimensional 
S-space into points in an R-dimensional space. 
This tremendous increase in the number of 
points in space is the basis for the storage of 
the balance of the data word. 

In general, if we consider an R-dimensional 
space we can position an (R — 1) dimensional 
hyperplane through this space so that the per- 
pendicular distances to R arbitrary points 
(which are assumed to constitute a basis for 
the space) from the plane may be arbitrarily 
specified. The restriction that the points must 
constitute a basis is equivalent to requiring 
linear independence of the base vectors from 
the origin of the system to the points. It is this 
latter statement which is the essential restric- 
tion for the weaker case in which fewer than R 
points are being fitted to a hyperplane in R- 
dimensional space. 

Let the equation of the hyperplane be given 
in the form 

XQ = d 

where the X and Q are nonzero, R-tuples. Then 
the equivalent normal form is^® 

XL = p 

where 



are the cosines of the direction angles of the 
hyperplane and p is the length of the normal 
from the origin of the R-space to the hyper- 
plane. This normal, which is of course unique, 
may be most easily expressed as an R-tuple, 
with one free parameter, p. 

N = ^L 

The /i, are now the direction cosines of the nor- 
mal. With the notions just developed it is now 
an easy matter to express the distance di of an 
arbitrary point a,n to the hyperplane. 

di p — a,^^ * L 

But this is a linear system, in general non- 
homogeneous, of the form 

a • L G 

where G is the R-tuple of elements (p — di). 
Since a solution of such a system for arbitrary 
G is possible if and only if the elements of the 
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linear system a • L are linearly independent, 
we arrive at the earlier restriction that the 
vectors to the points, «(,.», must be linearly in- 
dependent and therefore, in the case in which 
there are R-points, must constitute a basis for 
the space. 

As was shown earlier in the description of 
the formation of S-space and of the procedure 
by which points in it are mapped into points in 
M-space, these image points are restricted to 
be vertices of the R-dimensional binary-valued 
hypercube positioned in the vertex of the first 
octant of the space. Actually the restrictions 
were much tighter than this ; however, it 
suffices to consider any of the vertices of the 
hypercube in the following development. First 
we have just shown that a hyperplane can be 
found such that it lies at a specified, but arbi- 
trary, distance from each of R linearly inde- 
pendent points in R-space, as shown in the fol- 
lowing example in 3-space (Figure 5). 

Instead of considering a hyperplane at an 
arbitrary distance p from the origin, which lies 
at arbitrary but specified distances from the 
let us consider a hyperplane through the 
origin which lies at arbitrary relative distances 
from the In other words find Q such that 

Q?(i> • Q _ ^ 

j) • Q dj 

Recalling that the are vertices of the R- 
dimensional hypercube, we see that this is satis- 
fied by letting 


which is the quantity we propose to use in stor- 
ing f(X) ; i.e., let 

f(Xo)) =di IIQII ==an) • Q (5) 

We shall hereafter refer to the R-tuple Q as the 
hyperplane with the understanding that what 
is actually meant is that Q is the coefficient 
array of 

XQ = 0 

which is a linear equation in R unknowns and 
hence is the equation of the hyperplane. 

It is now possible to construct, in the instance 
where R linearly independent points are being 
fitted, a single plane through the origin with 
the required relative distances, say gi. Figure 6 



Figure 6. 

shows this for the example used earlier from 
3-space. 

If fewer than R-points are to be “stored’' 
using this technique, then there are correspond- 
ingly infinitely many solutions for each free 
parameter so introduced. The essential point 
though is expressed by equation (5), namely 
that the functicn to be stored at a point a^],) is 
given by the no aal distance to the hyperplane 
Q times the norm of Q, i.e., aru • Q. 

The distributed memory is achieved in the 
R-tuple representation of the solution hyper- 
plane since each element qi will in general be 
involved in the computation of the distance to 
many image points, and any individual distance 
is dependent on many q:. 
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THE ALGORITHM FOR THE SOLUTION OF 
THE ASSOCIATED LINEAR SYSTEMS 

Each data word as it is entered into the 
memory generates an element of a linear sys- 
tem. This system of T equations in R unknowns 
could, if it were available at one time, be treated 
by several conventional methods. Unfortunately 
the data points are presented serially so that 
only a single equation is available to the asso- 
ciative memory at any particular time, and we 
have presupposed no detailed memory for the 
elements of the linear system. Thus, any solu- 
tion will have to be generated by a sequential 
operation on each equation. There is only one 
technique in use for solving linear systems 
which has this essential property, i.e., sequential 
manipulation of one equation at a time. This is 
the ‘‘relaxation method’’ devised by Gauss and 
revived and popularized by R. V. Southwell and 
his colleagues.^^’-^^ Basically, this procedure de- 
pends on the successive minimization of a set of 
“residuals” associated with the family of linear 
equations being solved. Let the linear system 
be of the form 

n 

V ayQ, = f(Xo)) (6) 

i=:l 

then the residuals, Rj, are defined to be 
n 

f(Xa,)— a«Q,' = R, (7) 

i = 1 

where the primed values of Qi indicate some 
estimate of the actual vector Q. The most com- 
monly employed algorithm used in relaxation 
of the family given by (6) depends on comput- 
ing all of the residuals, Ri, R 2 . . . Rt, for some 
assumed value of Q. The largest of these resid- 
uals is then chosen and a change of the ele- 
ments in the assumed Q is made so that this 
residual is caused to become zero (or near zero) 
and a new family of residuals is computed for 
this new value of Q'. This procedure is re- 
peated until all residuals are within a tolerable 
bound of zero, at which time the value of Q' is 
considered to be the approximate solution for Q. 
The above description of the relaxation method 
is based on the most commonly used algorithm, 
but not the only one used ! Other techniques de- 
pend on choosing the residual which requires 


the greatest change in Q' for its liquidation, 
over-relaxing so that the algebraic sign of the 
residual being liquidated is changed, and under- 
relaxing, in which only a part of the residual is 
liquidated. We do not intend to discuss relax- 
ation methods, as such, and the foregoing was 
intended only to show the relationship of the 
algorithm used in the present model to an exist- 
ing procedure for the solution of a system of 
linear equations. 

In each of the algorithms mentioned above 
for use with relaxation techniques, the entire 
system of linear equations was assumed to be 
available at one time so that all of the residuals 
could be computed and a decision made, based 
on the values of these residuals, as to the next 
step in the process. In the model of the asso- 
ciative memory under discussion only one 
equation is available at a time, so that its resid- 
ual can be either partially or totally liquidated, 
without, however, knowing the effect of this 
operation on the other residuals. 

From equation (7) we may write the linear 
system, in residual form, as 

f(X,,)) — ao) • = (8-a) 

TTTVj/>»*/:k 'V’,. -ig -J +V» ■*-krviTrt+ rtV* rvoiDi-ri in 

VVll'CXC AO (/AlC: J-l/AA ^V/AAAlr ^AAV^OVJ^AA AAA KJ— 

(not a component of X) and a^j) is the image of 
X(j) under A in M-space (again not a compo- 
nent of a). If Rj is to be liquidated, and if no 
information is available concerning the relative 
value of a variable (i.e., the effect of a particu- 
lar variable on all other residuals) which is 
certainly true in the restricted system under 
consideration here, and if further all of the co- 
efficients are identical (1 in this ease), then 
the readjustment of Q' to a new value may be 
best accomplished by dividing the correction 
equally among all components. What we have 
said is that we have no information on which 
to base a selective adjustment of the variables 
in Q', and hence we will impartially adjust all 
of them by the same amount. The computa- 
tional algorithm takes the form : 



Q" = Q' + 

where the primes indicate successive approxi- 
mations of Q and 0 < k < 1. k is most often 
chosen to be 1 so that the residual at each stage 
is totally liquidated. 
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With the new estimate for the vector Q ob- 
tained from the iterative cycle (8-a,b,c), a new 
residual will be computed using the next set of 
coeflScients, aj, when they are generated. Ob- 
viously the second serious question concerning 
this particular model of an associative memory 
is, ‘'Under what conditions does the iterative 
procedure converge?” The convergence is 
analyzed in detail in Appendix 2; however, it 
suffices for the present purposes to summarize 
the results obtained there as saying that if the 
system has a solution, then the procedure will 
converge to it, and that if the system has no 
solution it converges to a “best” approximation 
in a sense which is discussed in Appendix 2. 

This completes the description of the asso- 
ciative memory model. We will now discuss the 
behavior of such a memory simulated on a 
CDC-1604 computer. 

EXPERIMENTAL INVESTIGATIONS 

The model for an associative memory devel- 
oped in the foregoing analysis leaves two prin- 
cipal questions to be answered by experimenta- 
tion. The first concerns the behavior of the 
iterative procedure when used to solve linear 
systems; the second concerns the behavior of 
the memory, i.e., its convergence during the fill 
operation and the accuracy of its estimation of 
previously unfilled points in S-space. 

In the first experiments no associative 
memory will be used; rather a linear system 
such as might be produced by an associative 
memory will be contrived to simulate the a vec- 
tors and demonstrate the solution behavior of 
the algorithm. 


digits^ each block of eight digits corresponding 
to a single linear element, and the appropriate 
combination was substituted for the decimal 
code digits to yield the system of forty random 
binary coefficients and forty linear elements. 
The first five elements are given in octal form 
below to indicate the general character of the 
linear system. 

15121441611320 

06454142431304 

12443512300720 

05105454214510 

11051505113014 

The statistically predicted overlap for such 
a system is 16%. This compares very well with 
the actual overlap of 15.8%. We thus have a 
system with a precisely known Hamming 
weight, 16 or a density of 40%, and a nearly 
statistically perfect overlap. One point still has 
to be insured — the compatibility of the entire 

A 

linear system. To insure this we assumed a Q 
and computed the functional values to be 

A 

a(j,. Q, thus guaranteeing the linear system to 
be a compatible one, even if the a's failed to be 

linearly independent (as they did). The Q was 
chosen to be 

AAA A 

Q = (qi, Qi*. • . • , q^..) 

where 

Qi = i ; i = 1, 2, ... 40 

The function f computed in this manner has a 
mean of 

f = p = 328 


Experiment No, 1 

In this experiment a set of forty linear equa- 
tions (with binary coefficients) in forty un- 
knowns, and a half set of these equations were 
solved. It was desirable that these equations 
all have precisely the same Hamming weight 
to make them more nearly like the a vectors 
generated by the associative memory. To ac- 
complish this the following scheme was em- 
ployed. Each of the possible distinct combina- 
tions of two I's and three O's were assigned a 

code number: 0, 1 9. A random number 

generator was used to generate 320 random 


and a standard deviation of 

a = Vnpd — p) = 63.5 

This system of forty compatible equations in 
forty unknowns is the basis for the first portion 
of this experiment. 


The following indicator of convergence was 
defined : 



I 1.1 1 

, A 

' Q| 


defined over a cycle. E is in some sense a mean 
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Figure 7. Convergence of a 40 x 40 Linear System. 

absolute percentage of error in estimating 

Q, and consequently is a useful in- 
dicator of convergence. It is important that f 
be well removed from 0, and that the cr be small 
enough to not produce large fluctuations in the 
percentages. The generating scheme described 
above was chosen to satisfy all of these require- 
ments. 

The convergence for the 40 x 40 system is 
shown in Figure 7. This plot also shows one of 
the empirically discovered features of this 
algorithm ; namely, that a plot of the logarithm 
of the mean absolute percentage error versus 
the logarithm of the cycle number is approxi- 
mately a straight line. This is qualitatively true 
of all the systems, which have been solved to 
date. 

The second portion of this experiment con- 
sisted of solving the first twenty elements of the 
preceding linear system, i.e,, a linear system of 
twenty equations in forty unknowns. E is 
plotted in Figure 8. 



Figure 8. Convergence of a 20 x 40 Linear System. 


Experiment No, 2 

The objectives of this experiment are basically 
the same as those which prompted experiment 
No. 1 ; however, there are two differences : 

1. The linear system has been increased from 
40 X 40 to 100 X 100. 

2. The coefficients have been directly as- 
signed by the random number generator 
so that the Hamming weight is only 
statistically defined. 

The a elements are generated by a random 
number generator, with the probability of hav- 
ing a 1 in any single position being 0.2. The 
mean Hamming weight of the a vectors there- 
fore is 

Ha = pn = 20 

with a standard deviation of 

= Vh„( 1 — p) = 4. 

After the a elements were generated by the 
above scheme, the functional values were as- 
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Figure 9. Convergence of a 100 x 100 Linear System. 

signed using the same procedure described in 
Experiment No. 1. The mean functional value 
then is given by 

? = » p = 1010 

iU 

’with a standard deviation of 

8 — — V np ( 1 — pi = 202 

In this experiment only the cyclical sequence 
for relaxation was used. Figure 9 shows a plot 
of the convergence of the system of one hundred 
equations in one hundred unknowns as a func- 
tion of the number of cycles through which the 
system has been relaxed. 

The linear system investigated in this experi- 
ment corresponds more closely to the systems 
which will be generated by the probabilistic 
associative memory, whereas the system of 
Experiment No. 1 corresponds more closely to 
those generated by a complete binary matrix. 
In the latter case, the Hamming weight of all a 


elements is exactly the same, while in the proba- 
bilistic case the Hamming weight is only statis- 
tically specified. One would expect the behavior 
exhibited in this case to extrapolate directly to 
larger linear systems and thence to the linear 
systems generated by the associative memory. 
The next experiment shows this indeed to be the 
case. 

Experiment No, S 

This last experiment (to be described in this 
paper) on linear systems differs from the pre- 
ceding experiment only in the size of the linear 
system considered — 250 equations in 1000 un- 
knowns. The generating procedure for the a 
elements and the method of assigning the func- 
tional values to insure compatibility are the 
same as those already discussed. The essential 
parameters descriptive of this problem then are : 


size 

250 X 1000 

P 

0.2 

H, 

200 


12.65 

f 

100,100 

(T 

6,331 


The convergence behavior of this linear sys- 
tem, when solved using cyclical relaxation, is 
shown in Figure 10. 

The test problems investigated in this series 
of three experiments were selected to show the 
essential properties of solutions generated by 
nonselective relaxation, i.e., relaxation which is 
nonselective in the sense that the set of resid- 
uals is not examined to determine which linear 
element is to be relaxed, and also in the sense 
that the coefficients are not considered in select- 
ing an element (or elements) to be relaxed. In 
addition to illustrating these general behavioral 
characteristics, the size of the systems being 
solved was gradually increased until they were 
comparable in magnitude to those generated by 
interesting associative memories. With these 
comments we leave the subject of the nonselec- 
tive relaxation technique and consider instead 
actual associative memory studies. 

Experiment No, k 

This experiment is concerned with an appli- 
cation of the associative memory model to a 
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Figure 10. Convergence of a 250 x 1000 Linear System. 


problem in pattern recognition, which while 
simple, is certainly not trivial. The experiment 
will be described in two stages : first the pro- 
cedure by which the patterns are generated will 
be described and analyzed, and then the asso- 
ciative memory which was used and its func- 
tioning will be discussed. 

It is desirable in order to simplify the anal- 
ysis of the results that the pattern generation 
scheme assign approximately half of the possi- 
ble inputs to the class of patterns and half to 
the class of nonpatterns. It is also desirable that 
the procedure have an intuitive appeal as a pat- 
tern generator, that is that the patterns should 
be recognizable by some relatively simple logical 
test. The following satisfies both of these cri- 
teria. Consider a four-element mosaic Xn, Xi^, 
X 21 , X 22 , where each element can assume any one 
of a discrete range of values, x. 

The pattern which is tested for in this experi- 
ment consists of a bar, either vertical or hori- 


zontal, where both the bar and the background 
are subject to noise. This is accomplished by the 
following logical test: 

f (X) = b, e h. 


where 



if min (x„, x,^) > max (X 21 , X 22 ) 
or min (x^i, X 22 ) > max (xn, X 12 ) 
otherwise 

if min (xn, Xi^) > max (Xi 2 , X 22 ) 
or min (xi^, X 22 ) > max (xn, X 21 ) 
otherwise 


This scheme has the intuitive appeal, if one 
interprets the range of possible values of x as a 
grey scale, of detecting a noisy pattern on a 
noisy background. The general rule for the 
recognition of a figure is that 

min (F) > max (B) 


where F designates the figure elements and B 
designates the background elements, i.e., the 
lightest element in the figure is to be darker 
than the darkest element in the background. 
Thus a contrast enhancing procedure which con- 
verted all elements of a field to black or white 
depending on whether they were greater than 
or equal to the minimum value in a potential 
figure or less respectively would reconstruct a 
black figure on a white background with no deg- 
radation, had the input mosaic pattern actually 
been in the class of figures. The probability that 
an arbitrary four-element pattern, with a grey 
scale of 48 steps, is recognizable as a bar is 
0.3197, which provides an adequate density of 
patterns for the purposes of the present ex- 
periment. 


The associative addressing matrix used in this 
experiment had an effective size of 9600 col- 
umns, each of which was 192 bits in length. The 
actual matrix was 1/48-th this size, 192 x 200, 
but a technique devised by Dr. H. Everett of 
WSEG makes it possible to use this smaller 
matrix in such a way as to be equivalent to the 
larger memory. The matrix is filled with ones 
and zeroes using a random number generator, 
with the probability of a one being entered in 
any bit position being a prechosen value p : in 
this example p = 0.2, 




504 


PROCEEDINGS— SPRING JOINT COMPUTER CONFERENCE, 1964 


The threshold was selected to be 8 = 3. Using 
these parameters it is possible to define the sta- 
tistical behavior of the memory in generating 
the associative addresses. The probability that 
any particular a element is a one, P , is given by 
4 

P = ^ ^ ) (0.2)* (0.8)^-* = 0.0256 

i = 3 

Consequently the mean Hamming weight of 
the a vectors is 

H, = 245.76 

with a standard deviation of 
.r,, = 15.48. 

This says that the number of a elements which 


will be operated on in any single relaxation 
cycle is between 215 and 277 with a 95% likeli- 
hood, which is a satisfactory description of the 
extent of the alteration in Q introduced by re- 
laxation. 

The associative memory was shown a se- 
quence of 400 patterns constructed by using a 
random number generator to assign values to 
the Xy. Whether a particular mosaic input was 
a pattern or not was determined by applying 
the logical test. It is worth noting that these 
400 points were drawn from an environment 
of 5, 308, 416, (48^), possible mosaic configura- 
tions. The scores of the device are given in 
Table I, as accumulated over cycles of fifty 
exposures. 


TABLE I 

Number of “patterns” Pattern 
Cycle in the cycle Errors 


Nonpattern 

Errors 


1 30 

2 34 

3 25 

4 31 

5 33 

6 29 

7 26 

8 26 

Totals 234 


27 

10 

4 

8 

2 

3 

4 
7 

65 


2 

12 

16 

6 

11 

10 

8 

4 

69 


The results tabulated in the preceding para- 
graph summarize quite well the operation of the 
associative memory as a pattern recognizer. 
The fact that roughly the same number of pat- 
terns and nonpatterns have been misclassified, 
out of a population of 234 patterns and 166 
nonpatterns, is an indication of unbiased per- 
formance in the classification of mosaic figures. 
These results are shown graphically in Figure 
11 where the raw scores, accumulated over 
cycles of fifty exposures, and a smoothed curve 
are plotted versus cycle number. 

Experiment No. 5 

This experiment is devoted to an investiga- 
tion of the behavior of the associative memory 
when used to store a simple algebraic function 


originally used by Smith-’ in his studies of a 
perceptron-like computer model, ADAP II. 
This particular function was chosen because it 
embodied several special features whi;h allow 
a testing of the associative memory functions 
and because it permits a qualitative comparison 
of Smith’s results and those achieved using the 
associative memory. The “Smith function” is 
given by 

F„ = X, Xm -f 2xn -|- 0 • X4 
which has a mean of 



where kj is the number of nonzero steps in the 
discrete range of the i-th variable. For test pur- 
poses this function has several advantages. It 
involves a cross product term which has proven 
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ERRORS X lO"' 



Figure 11. Scores of the Pattern Recognition Program 
on Cycles of 50 Points Each. 

to be a very difficult function for perceptron- 
like (single layer perceptrons) devices to learn. 
It has a term which has a zero coefficient and 
hence, although this variable enters into the 
associative addressing, cannot affect the func- 
tional value. Finally the basic form is a linear 
summation of terms, a functional form most 
easily stored in the quasi-linear associative 
memory. In all of the tests which make up this 
experiment n = 9, i.e., Xj = (0, 1, . . . , 9). The 
matrix is 192 x 9,600 with the Everett scheme 
for permuting a base matrix being applied to a 
basic 192 x 200 array. 

The essential parameters of the experiment 
are 

p = 0.2 

S =r. 3 

which gives a mean Hamming weight of 
= 261.12 

A random number generator was used to as- 
sign random values (0 — 9) to Xi, X2, X3, and X4 
and the corresponding Fs was calculated using 
these random variables. The convergence plot 
for this test is shown in Figure 12. E is defined 
in a slightly different manner here than before. 
In the case of the contrived compatible linear 
systems where E was first introduced, the func- 
tional values were tightly clustered about the 
mean and bounded away from zero so that no 



Figure 12. Convergence for Fj^ with p — 0.2; S — 3. 


difficulty could be encountered in defining E to 
be the average absolute percentage error. Here 
the function is very broadly spread, and further- 
more not bounded away from zero; therefore 
the following related definition for E is used in 
discussing this experiment. 

D 


J_/ 1 : ^ 

i = 1 

where D = 60 in this experiment. 

The following test is actually the crux of this 
whole series of experiments since it demon- 
strates the actual operation of the associative 
memory. This test is concerned with the mem- 
ory’s performance on the F, function using the 
parameters given in the preceding paragraph. 
The convergence plot for this particular test 
has already been given in Figure 12 ; however, 
the detailed behavior of the memory is lost in a 
smoothed measure of convergence such as E. 
Since the primary function of the associative 
memory is the recall (or estimation) of func- 
tional values it is highly desirable to present the 
memory output in a form where its perform- 
ance can be conveniently judged. 

Two questions which occur naturally when 
one considers an associative memory are : 

1. How will it perform in a new field of data 
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n 


Figure 13. Test of a Filled Memory on Random Points 
in X Space (□ = predicted, • = actual values). 

points, if forced to estimate without new 
data being entered? 

2. What is the span of its memory; that is 
how well will it do if shown the same set 
of data points with which the memory was 
originally filled, compared to a new field 
of data points? 

Both of these questions have been investigated. 
The memory was “filled” by drawing 3000 data 
points at random as described before, and then 
by setting the relaxation coefficient to zero, 
k = 0, further changes in the Q vector were pre- 
vented. First a sequence of random points, pre- 
sumably new ones, were used to query the 
memory. The results of this test are shown in 
Figure 13. E = 0.10444 which indicates that 
the terminal performance of the associative 
memory while in the learn cycle, E = 0.10248, is 
probably a measure of its storage accuracy over 
the entire functional space. The second question 
actually answers this by showing that the de- 
tailed memory of the actual data points entered 
in the memory has been sacrificed for a type of 
distributed functional memory for the entire 
functional space. Figure 14 shows this phe- 
nomenon in a very striking fashion. The points 



Figure 14. Test of the Survival of Detailed Memory 
Entries (□ = predicted, • = actual values). 

immediately preceding the end of the storage 
period are all queried, with the unexpected re- 
sult that the memory has a detailed retentivity 
period of perhaps 500 points, or half a cycle, 
and that beyond this the individual data points 
have been lost individually and have been 
assimilated instead in a collective form as a 
“smoothed’' function. 

In conclusion then, the series of tests which 
made up Experiment No. 5 has demonstrated 
the behavior of the associative memory on an 
actual notrivial problem. The data could just 
as well have been drawn from any number of 
real-life problem situations ; however, it seemed 
appropriate to discuss a well-behaved analytic 
function in this paper to permit the reader to 
readily judge the performance of the memory. 

CONCLUSION 

In view of the length of some of the argu- 
ments which have been presented, it seems 
especially desirable to give a succinct descrip- 
tion of the mathematical model for an associa- 
tive memory developed in this paper. 

A function (commonly a decision function) 
f(X) is defined over an n-dimensional input 
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space, in which each of the variables is re- 
stricted to assume one of a discrete set, ki in 
number, of values. Each of these input points 
is mapped linearly onto a vertex of an N- 
dimensional unit hypercube, 

N= i K 

i = 1 

all of whose vertices make up the so-called S- 
space. The function f (X) is the desired mem- 
ory fill. 

A metric Si is defined by 

S = X ' A 

where A is an (N x R) matrix whose columns 
are points in S-space. These points, Ai, consti- 
tute a set of R fixed reference points in S-space. 
The metric Si defines the distance of X from Aj, 
and was shown to be complete. A much weaker 
metric, 0 : 1 , is defined on S by the logical opera- 
tion 

_ri if Sj. ^ 8 

~ 10 if Si ^ 8 
where 8 is a threshold value ; 0 ^ 8 ^ n. 

It was shown that for some A (i.e., for a suffi- 
ciently large number of fixed reference points) 
there exists at least one hyperplane, Q, in the M 
space in which a is defined, such that the dis- 
tance at which the point a, whose pre-image as 
defined above is X, lies from Q is the function 
f(X) multiplied by a suitable normalizing 
constant. Such a hyperplane may be found by 
the iterative procedure 


the distance d between words is computed and 
the entries are organized accordingly, d- was 
defined as a function of the N-tuples represent- 
ing the points in S-space and one must show 
that the same distance can be calculated for 
any of the proposed image spaces. In this ap- 
pendix we shall prove that d- is the same for X 
and X', S and S' and a. and a'. 

Following our earlier usage the vectors in 
S-space will be assumed to have N elements 
grouped in n partitions or variable ranges in 
each of which a single one occurs. Now let A 
be the complete N x 2^ binary matrix in which 
every possible N bit binary word appears as a 
column of the matrix. The S matrix is then a 
(1 X 2'^’) row matrix formed by multiplying A 
by X. If we consider two vectors, say X and X', 
in S-space the distance between them is found 
from (3) to be 

d2 = n — X'X'*' (3) 

The first step in demonstrating that the weak 
metric on S is a complete one is to show that 
d^ as given by (3) may be computed from S 
and S' and non-specific knowledge about 
S-space : 

XA = S and X'A = S' 

form 

X'A(XA)T = S'Sf 
or 

X'AA^X'A^ ^ g'S^r (9) 

The unusual properties of the complete binary 
matrix make it possible to greatly simplify the 
left hand side of (9) through a manipulation 
of AA’^ 



( 8 ) 


AA'T 



- 2^-2 
^ 2^-1 


if 

if 


i ^ j 
i = j 


irrespective of any ordering of the Aj. in A. Thus 
AA'J’ can be decomposed to the sum of two par- 
ticularly simple matrices, i.e., 


which was proven in Appendix 2 to converge. 

Thus we may finally conclude that the out- 
put of this particular model of an associative 
memory (at a time where Q' is the estimate of 
Q) upon being interrogated with the address 
(associative criterion) X is 

F(X) ^a(X) • Q' 

APPENDIX 1. COMPLETENESS OF THE: 
d2 METRIC 

In the' paraller associative addressing of data, 


1 ... 1 

AA'*’ - 2^-2 . - 2^-2 I 

1 ... 1 ^ 

This formula may now be re-introduced into 
(9) to give the following very simple form 

1 ... 1 

2N-2X' • • XT + 2^-2 X'XT = S'ST 

^ 1 . . . 1 J ( 10 ) 
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If one notes that the effect of the matrix opera- 
tion in the left hand member is merely to tally 
twice the nonzero elements of X' and and 
that this number is n, then (10) can be reduced 
and the resulting equation solved for X'X'*' 

X'XT = { S'ST — n22^-2 ) (11) 

Finally the distance between X and X' is found 
to be 

d" == n ( S'S'*' — n"2^-2 } (12) 

where the right hand term is expressed only in 
terms of S, S' and parameters descriptive of 
the general characteristics (n and N) of 
S-space as was desired. This proves that for 
some R, in this case R = the metric de- 
fined on S is a complete one, that is to say, that 
the location of a point in S-space is unambigu- 
ously specified by the S vector. 

First consider the form of the S matrix, 
S — X • A. It will have integer elements with 
all integer values 0 thru n being represented. 
This can be decomposed into the following use- 
ful form 

n 

S = ^ S(i) (13) 

i = 1 

where each of the S(i) is the matrix whose 
elements are either i or 0. This representa- 
tion of S allows one to represent a analytically 
in the following form. 

n 

a=^YS(i) (14) 

i = S 

where 8 is the threshold value chosen for the 
logical operation in (6). 

The basic problem is: given a and a', is it 
possible to compute the distance between X 
and X', or in other words is the weak metric a 
complete in the extended space? Note that the 
value of the threshold 8 against which a is com- 
puted is left unspecified, and hence can be con- 
sidered as one of the parameters of the prob- 
lem. Assume 8 to be n ; in other words the asso- 
ciation vector must have a 1 in precisely every 
code position in which the X vector has a 1 if 
the corresponding a element is to be nonzero. 
As we have noted before, this result is not 


affected by the presence of extra I’s in the 
association vector, a is now a single term and 
may be written as 

a=^S(n) (14-a) 

It is now possible to compute the number of 
nonzero elements in a rather simply. If there 
are n partitions with k;. discrete values possible 
in the i-th partition, and if the association 
matrix is the same complete binary matrix dis- 
cussed before, the number of elements whose 
value is n in S is 


n 

<f>o(n) = 2*'’“^ 

i = 1 


(15) 


If we consider another vector with (n — 1) of 
the partitions identical, with a single element 
different in a partition possessing kj elements, 
then the number of ai = n in the same relative 
positions of the two a vectors is 

,^i(n) ^ -fT x2^'“^ 

i = 1 


i =7^ j 

This argument is simply extended to the case 
where the two X vectors differ in an arbitrary 
number, p ^ n, of partitions 

n j(p) 

,^„(n) = yj 2 ^'-^ 

i = 1 j = j(l) 


i ¥= j 


(17) 


It is possible to express X as a function of a 
as defined by (14-a), <#)o(n) as given by (15) 
and A, or more precisely A’’’. We shall now 
prove the following theorem. 



First consider the term aA'^. ai is a one, if and 
only if, Si is equal to n, which is to say that the 
binary cycles of the rows of A, assumed to be 
ordered as shown in the previous example for 
this argument (which correspond to a 1 in the 
elements of the X matrix) must all be positive, 
or equal to one in this case. Thus, the operation 
of matrix multiplication followed by the logical 
decision (6) is exactly analogous to a double 
logical extract operation. If we now reverse 
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this operation and extra a against i.e., the 
logical operation 
n 

^ aiAy = J3j (19) 

i = 1 

The rows of A which contributed to making Si 
pass the threshold test will now be “in phase'' 
with the variations of aj.; that is they will be 
positive in all cases when aj, is positive. All 
other rows will assume all possible binary ar- 
rangements while these are fixed and conse- 
quently will be positive and zero equally often. 
Thus the “in phase" rows will contribute a 1 
for each 1 in a, or what is the same thing, if 
R is understood to represent a general “in 
phase" row of A and R'^ is the corresponding 
column of A'^' 

n 

a R'^= ^ a, = <t>o{n) ( 20 ) 

i = 1 

and similarly if r is a general “out of phase" 
row of A, and r'^ is the corresponding column 
of 

n 

ar-" = 1/2 Y cc,= (21) 

i = 1 

But if one recalls the definition by which the 
terms “in phase” and “out of phase” were intro- 
duced, namely, that an in phase row of A is one 
which corresponds in position to nonzero ele- 
ment of X and conversely for the out of phase 
rows, we see that the vector X is derivable 
from aA’’. aA"^ is therefore a (1 x N) row 

vector which has in every position in 

which X has a zero, and <#>o(n) in every posi- 
tion in which X has a one. Equation (18) is 
thus obtained by a trivial algebraic manipula- 
tion upon aA’’'. 

The original question which prompted this 
discussion can now be answered in the affirma- 
tive — for an appropriate choice of the thresh- 
old, S, and for R sufficiently large (at least for 
the case R = 2^' as we have demonstrated) X 
can be expressed as a function of its distance 
from each of the association vectors in terms 
of the weak metric a. Certainly if X and X' 
can be expressed in this manner, the distance 


between them, d^, can also be expressed in terms 
of the same variable since d^ is defined on X 
and X'. This is incidentally a much stronger 
result than the one which we set out to prove 
which only required that we be able to find d-, 
not that we be able to solve uniquely for X and 
X'. The weaker case is interestingly enough 
much harder to prove ; however, it is also much 
more valuable in the application of the con- 
struction to modeling real associative memories. 


Having completed all of the preliminary for- 
mulations required to compute d^ for any vector 
pair, X and X', in S-space it is an interesting 
exercise to compute the formula for d'‘‘ itself. 


d2 


= n — 


2a^A^ 

^o(n) 


( 1 .. 


r 2aA'^ 
l<^o(n) 



This may be expanded into the following form 
which is more amenable to simplification and 
computation. 


d2 = n — N 


-#>0 (n) 


4 2 

a'A’TAa'^ — 


<f>fl(n) 




The two matrix expressions 


1 J 


( 22 ) 


1 1 


(1 . . . 1) Aa'i' and a'A'*' 


I 1 J 


may be reduced to their equivalent scalars. 
First by the argument of the previous section 
a' A’’’ is a (1 x N) row matrix which has the 

element in every position in which X 

had a zero and the element ^.(n) in every 
position in which X had a one. A similar state- 
ment applies to the column matrix Aa’'. The 
final matrix operations 

r 1 1 


(l...l)XandX 


I 1 
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simply sum these elements so that one may sum- 
marize these results by 


<^o(n) 


(l...l)AaT = 


r 1 1 


<#>o(n) 


aA'f 


(N + n) 
(23) 


which makes it possible to express (22) in the 
following simple form 

= (N + 3n) — -4;^ a'A'^Aa-^ (24) 

which may itself be simplified even further by 
expanding A''’A into a summation of simple 
matrices. 


The results expressed by formulas (12) and 
(24) show that d^ is computable from the 
images of X and X' into S and S' or a and a' 
respectively and that therefore the metric d^ 
is complete in all three spaces as was to be 
shown. 


APPENDIX 2. CONVERGENCE PROOF 
FOR THE ALGORITHM 

The iterative procedure, which we have 
called nonselective relaxation, upon which the 
memory fill is dependent, must be proven to con- 
verge. It is much easier to approach such a 
proof by reducing the procedure to its geo- 
metrical equivalent. 

Let the estimate of Q stored in the memory 
at the time a data word {f (x) ; X) appears be 
Q', then the iterative sequence is : 

R = f(X) — a • Q' 


I i 

Q" = Q' Aa'^ 

It is convenient to use the duals of a and Q, i.e., 
instead of the a being points in M space and Q 
a hyperplane through the origin, we consider 
the a to be hyperplanes whose normal from the 
origin was the point a befoEe^. and Q becomes 
a point in S space. Using ISis convention the 
sequence (8) has a very simple geometrical 
interpretation. R is the normal distance of the 
point Q from the hyperplane defined by a. Q" 
then becomes the normal orthogonal projection 
of Q' into the hyperplane, as Figure 15 illus- 



trates, jr, is the hyperplane defined by ani. In 
the above example it is obvious that Q will con- 
verge to the solution point Q; however, a gen- 
eral proof is required for higher dimensional 
spaces, and for over determined systems. 

Let jri (i = 1, 2, . . . , R) be a family of 
(n — 1) dimensional hyperplanes defined by 
the aii) in E„, and let the indices designate some 
arbitrary ordering. We will represent by the 
symbol Ti.t+i the projection of in into in + i, per- 
pendicular to TTi+i, from the point at infinity, i.e., 
a parallel orthogonal projection of n-i onto in+j. 
We shall define the projective transformation 
P to be the product of the R projections T, 

P = T„ T,3 . . . T,u 

« 

which is permissible since the product of arbi- 
trarily many projections is itself a projertivity. 
From the definition of P it follows that the 
image of any point in ti under P is also in irj. 

Theorem: Let Q be an arbitrary point of vi, 
then the limit of P"Q(, exists, and furthermore 
is a fixed point of P, i.e., 

Lim P"Q„ =r Q Q„, Qt^, 

and 

PQ = Q 

Preliminary Remarks : 

1. There is at least a single fixed point for 
every projectivity,^^-^’^ and hence for P in 
particular. 

2. The transformations Ti,i + i as defined 
above have the following property, where 

||TQ;— TQ.i|j =:k,j|Q{ — Q,Vi (25) 

where 

Q 1, Qo «fi, TQJ, TQ'i €171+ 1 
and T has been written for T,. , + 1 since no con- 
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fusion can result. ki is a non-negative scalar 
bounded by the following inequality 


and 


cos @i ^ ki ^ 1 


cos ®i = 1 


tti • Oti+i 




“1+1 


is the unit vector along the normal from 

«i 

the origin to the hyperplane ^n. 


Now choose an arbitrary Qo citj, and let Qi be 
the image of Qo under P 

PQo = Qi 

Case 1. If Qo « S then ^ 

PQii = Qo 

and Qo is a fixed point of the transformation 
and the theorem is trivially true. 


ki cannot be specified more tightly without 
knowing the orthogonal projection of the line 
segment [Qi, Qo] on the intersection of and 

ITl + l. 


Proof : Since Ti,i+i is a linear transformation 
we may write 
||Tj, j + iQi — Tj, j+iQfl|| “ 

I |Tj, ,Q^-i - T, jQi-’ 1 1 (26) 

etc. for j steps in all ; where 

Qii Qo ^■j etc. 

However, stopping with the step shown in (26) 
and using (26) we may write 

IITQi-^TQilll =k,||Ql~Qi||=r 

kjiiTQi-i-TQJ-i|| 


where the T in the left hand norm is Tj, j+i and 
in the right hand norm is Tj-i, j. Now by re- 
peatedly applying this reduction, j times in 
all, we obtain 

j 

||T,.,+,Q{ — Tj.j+,Qi||= Xf kiilQi — Qo|| 

i = 1 (27) 


where 


Qi» Qo 


sa-i. 


The R hyperplanes are considered in a cycli- 
cal order ; therefore on the R -f- 1-st projection 
the images will be in wi, and from (26) and 
(27) we have: 


||PQi — PQoll = K||Qx — Qoll (28) 


where 


R 


K= IT k, 

i = 1 


P is a projectivity of »-i onto itself, and as such 
(as noted in Remark 1) has at least one fixed 
point, or in general some fixed subspace S such 
that for every 

(T c S 


P <r cr 


Case 2. Let Qo e irj ; Q« e S 
then 

Qo ¥= Qi 

and at least one ki ^ 1. kj = 1 is equivalent to 
saying that the norm of the segment [Qi, Q«] 
is the same as the norm of the orthogonal pro- 
jection of [Qi, Qn] on the intersection of irj 
and JTj+i, say pj. But then for all p, to be 1 
would require that the pj be pairwise parallel, 
i.e., Pj parallel to pj+i, etc., and Qo = Qi neces- 
sarily, in contradiction to the assumption that 
Qo 7 ^ Qi. Therefore at least one ki ^ 1 and 
R 

K = IT k, < 1 
i =*1 

Equation (28) may be expressed as 

IjPQj — PQj-i|NK||Qj — Qj-i|| (29) 

where we need not show superscripts on the Qj 
since all points and their images under P are 
in TTi alike. Equation (29) may be recursively 
solved to yield 

||PQj-PQj-i|| = l|Qj..i-Qj-||^ 

Ki||Qi — Q«|| (30) 

where 

IIQi — Qoll > 0. 

Now we wish to determine the Cauchy differ- 
ence for the sequence {Qj} ; |Qj+m — Qj|i- Using 
(30) repeatedly we obtain 

!lQj+„_Qj||^ (Ki+“-i -h KJ+--2 

+ . . . +kJ||Qi — Qoll 

after the usual introduction of the missing 
terms and appropriate regrouping. 

I iQi+o. — Q jI 1^ Ki ( \~ -x) I I 

but for a suitable choice of j the right hand side 
of (31) can be made less than any e > 0 ; there- 
fore the sequence {Qj} converges to a point Q; 
Qer,. 
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We must still prove that Q is indeed a fixed 
point of P, i.e., Q € S. 

||PQ — Q|| ^ ||PQ — PQjII + ||PQj — Q|| 
^K||Q — QjII + IIQj+i — Q|| 

but both of the right hand terms have been 
shown to go to zero as j increases; therefore 
I |PQ — Q| I < e for every € > 0 and Q is shown 
to be a fixed point of P, or 

PQ = Q :> Q € S 

Corollary: If the system of hyperplanes de- 
fines a determinate system of linear equations 
with a unique solution point, then this point is 
the only fixed point under any of the possible 
projective cycles P, i.e., any reordering of the 
indices, and the above proof shows that 

Lim P" Q' = Q 

n 00 

where Q' is any point in any plane and Q is the 
unique solution point of the system. 

The foregoing argument completes the con- 
vergence proof for the algorithm proposed for 
filling the associative memory. 
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DESIGN OF AN EXPERIMENTAL MULTIPLE 
INSTANTANEOUS RESPONSE FILE* 


E, L, Younker, C, H. Heckler, Jr,, D, P, Masher, and /. M. Yarborough 
Stanford Research Institute 
Menlo Park, California 


SUMMARY 

An experimental model of an electronic ref- 
erence retrieval file in which all file entries are 
interrogated simultaneously has been designed 
and constructed. The experimental model is 
designed to store and search on a file of indexes 
to 5,000 documents. A document index consists 
of a decimal accession number and up to eight 
English word descriptors that are closely re- 
lated to the contents of the document. The 
vocabulary required to describe the documents 
is held in a machine dictionary that has a 
design capacity of 3,000 words. In the model 
delivered to the sponsor, Rome Air Develop- 
ment Center, the storage capacity is only par- 
tially used. The specification for the delivered 
model calls for the storage of approximately 
1,100 documents that were selected from the 
ASTI A (now DDC) Technical Abstract Bul- 
letin and of the vocabulary needed to describe 
them (about 1,000 \^ords). The document in- 
dexes and the dictionary words are stored in 
wiring patterns associated with arrays of 
linear ferrite magnetic cores. 

A search question, consisting of one to eight 
descriptors in their natural English form, is 
entered by means of an electric typewriter. 
During entry of the search question, the dic- 
tionary magnetic store is interrogated by the 


alphabetic code of each search word. If a word 
is not contained in the dictionary, it is auto- 
matically rejected. After all words of the 
search question have been entered, the docu- 
ment magnetic store is interrogated by the 
search question in superimposed code form. 
The comparison between the search word and 
the document indexes is made for all documents 
simultaneously and the machine instantaneously 
determines if any documents in the file in- 
clude the search question. If there are none, 
the machine indicates visually that there is no 
response. If there is at least one, the machine 
counts the number of responding documents 
and displays this number. Then it types out 
the indexes of all responding documents on the 
same typewriter that was used to ask the 
question. 

INTRODUCTION 

Memories that can be searched in parallel 
and from which stored information is retrieved 
on the basis of content have received consider- 
able attention for application to retrieval file 
problems.^* 2, 3, 4 paper describes the de- 

sign of an experimental retrieval file based on 
the work reported by Goldberg and Green.^ 
Since the contents of the semipermanent mag- 
netic memory used in the experimental file can 
be searched in parallel and multiple responses 


* The work described in this paper was supported by Rome Air Development Center under Contract AF 30(602)- 
2772. 
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to the search question are permitted, the system 
is called MIRF — Multiple Instantaneous Re- 
sponse File.5 

LOGICAL ORGANIZATION OF THE MIRF 
SYSTEM 

The logical organization of the experimental 
MIRF system is illustrated by Fig. 1. Infor- 
mation pertaining to the document indexes and 
to the descriptors used in the document indexes 
is contained in two major units called MIRF 
units. A MIRF unit is basically a magnetic 
memory in which information is permanently 
stored in the wiring associated with the mag- 
netic cores. The Document MIRF is the 
principal element of the system. It contains 
fdr each stored document index the document 
accession number and the descriptors (in 
coded form) that describe that document, as 
well as a superimposed search code that is used 
in the searching process. The Dictionary MIRF 
has two functions. During the input phase of 
operation it translates the alphabetic code of 
the English word descriptor that is entered 
from the typewriter into the binary serial 
number assigned to that English word for use 
inside the machine. During the output phase, 
the Dictionary MIRF translates the binary 
serial number of a word that is obtained dur- 
ing a search into the alphabetically coded form 
of that word. 

After the binary serial number of an input 
English word has been generated, this binary 


number is translated by a logical process in the 
Search Code Generator into a search code that 
is assigned to the particular English word. 
The search codes of successive words of a search 
question are superimposed by adding them to- 
gether, bit by bit by an inclusive-OR operation. 
When the search question is complete, the 
superimposed search code of the question is 
compared with the superimposed code section 
of the Document MIRF. Each document index 
whose search field includes the superimposed 
code of the search question is said to respond 
to the question. Frequently more than one docu- 
ment will respond. By a logical process for 
resolving multiple responses,® the accession 
number of a particular responding document is 
generated. Then the binary serial numbers of 
the English words contained in this document 
index are generated one at a time. By means 
of the Descriptor Selector, each serial number 
is transmitted to the Dictionary MIRF, whe're 
it is translated to the alphabetic code of the 
English word. This process is repeated for 
each responding document. 

SYSTEM DESIGN 

1. Magnetic Implementation of the MIRF 
Unit 

The MIRF units of the experimental model 
use an interesting modification of the Dimond 
Ring'^ translator in which the drive and sensing 
functions are interchanged. Information is 
stored in unique wiring patterns associated 
with an array of linear ferrite cores as il- 
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lustrated by Fig. 2. Each item of stored in- 
formation (a document index in the Docu- 
ment MIRF or a descriptor in the Dictionary 
MIRF) is represented by a conductor that 
passes through or around each associated core 
in a unique pattern determined by the informa- 
tion it contains. In series with each conductor 
is a diode. The cathodes of many diodes are 
connected together to form the input to a de- 
tector amplifier. Notice that one core is re- 
quired for each bit of information, but that 
each core can be associated with a particular 
bit of many item conductors. 

Each core has an input winding that can be 
selected by means of a switch. All cores whose 
selector switch is closed will be energized when 
a drive pulse is applied. A voltage will be in- 
duced in each item conductor that threads an 
energized core, but no voltage will be induced 
in conductors that do not thread the core. A 
test can be made on the information stored in 
many cores by selecting a particular set of 
cores and energizing them. In order for an 
item to match the test information, its conduc- 
tor must pass outside of every energized core. 
Then no voltage will be generated in the item 
wire and the input to the detector amplifier 
will be held near ground through the item 
diode. Voltages will be induced in the conduc- 
tors of items that do not match the test; the 
polarity of these voltages is chosen to back-bias 
the associated diodes. If no item matches the 
test information, a voltage will be induced in 
every item conductor and every diode will be 
back-biased. The input to the detector will 
then assume a significantly negative voltage. 
Thus, the presence or absence of desired stored 
information can be determined by applying the 
drive currents to a particular set of cores. 
This is a function of an associative or content- 
addressed memory: to indicate the presence or 
absence of certain information based on the 
detailed contents of a search question without 
regard to the actual location (or address) of 
that information. 

Now consider in more detail how a bit of in- 
formation of a search question is compared 
with information in a MIRF unit. Figure 3 
illustrates how a test is made to determine 
whether or not the test bit is logically “in- 
cluded’’ in the stored information. This cir- 



Figure 3. Circuit for Testing Inclusion. 


cuit is typical of those used in the superimposed 
section of the Document MIRF. One core is 
used to store the fcth bit of many items. The 
fcth bit of the search question is stored in a 
flip-flop whose one side is connected by way 
of an AND gate to a drive amplifier, which in 
turn is connected to the primary winding of 
the feth core. The conductor of an item whose 
feth bit is equal to one (Conductor 1) passes 
outside the kth core. On the other hand, the 
conductor of an item whose A:th bit is equal to 
zero (Conductor 2) threads the core. If the 
flip-flop stores a one, the primary winding of 
the core will be energized when the timing 
pulse is applied to the AND gate. A voltage 
will be induced in Conductor 2 (indicating a 
mismatch) but none will be induced in Conduc- 
tor 1 (indicating a match). If the flip-flop 
stores a zero, the primary winding will not be 
energized because the timing pulse will be 
blocked at the AND gate. No voltage will be 
induced in either conductor, and a match will 
be indicated on both lines. Therefore, it can 
be seen that a stored one bit includes both a test 
one and a test zero, while a stored zero bit in- 
cludes only a test zero. 

The circuit for testing for identity between 
the test bit and the information stored in the 
MIRF is shown in Fig. 4. This circuit is typical 
of those used in the alphabetic descriptor por- 
tion of the Dictionary MIRF. The ;th bit of 
many items is stored in a pair of cores Ja and 
Jb. The jih bit of the test question is stored in 
a flip-flop. In this case, both the one and zero 
sides of the flip-flop are connected to AND gates 
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Figure 4. Circuit for Testing Identity. 


whose outputs control drive amplifiers that are 
connected to the primary windings of the cores 
Ja and j'b. The conductor of an item whose jth 
bit is one (Conductor 1) bypasses core Ja while 
the conductor of an item whose jth bit is a 
zero threads core j’a. The threading of core in 
by the two conductors is the reverse of the wir- 
ing of core j’a. If the flip-flop stores a one, the 
primary winding of core Ja will be energized 
when the timing pulse occurs. No voltage will 
be induced in Conductor 1 (a match indication) 
but a voltage will be induced in Conductor 2 (a 
mismatch indication). If the flip-flop stores 
a zero, the primary winding of core Ju will be 
energized. In this case, a voltage will be in- 
duced in Conductor 1 but not in Conductor 2. 
Thus it can be seen that the bit stored in the 
MIRF must match the test bit identically for a 
match indication to be obtained. 

2. Basic Operations Using the MIRF Units 

Two types of operations involving the MIRF 
units are basic to the operation of this experi- 
mental model. One operation tests to see if 
certain information is contained in the MIRF. 
The other uses information that is contained 
in the MIRF to generate a number in a flip- 
flop register external to the MIRF unit. Ex- 
amples of these basic operations are given in 
the following paragraphs. 

a. Testing of Information Contained in the 
MIRF’ Unit 

Dictionary MIRF — During the input of the 
English words to form a search question, the 


Dictionary MIRF is tested to see if the input 
word is contained in the vocabulary (that is, if 
it is a valid descriptor). This is done by gating 
the alphabetic descriptor register to the drive 
amplifiers associated with the alphabetic por- 
tion of the MIRF (50 bits long, two cores per 
bit). As a result, 50 drive amplifiers are ener- 
gized and 50 primary windings in the MIRF 
carry current. If one of the stored words has 
a bit pattern in the alphabetic portion that 
matches identically the energized set of pri- 
maries, the match detector will indicate a match 
condition. If not, the match detector will in- 
dicate a mismatch condition. The output of the 
match detector is used to determine the next 
step in the logical sequence. It is important to 
note that the test is applied to the entire Dic- 
tionary MIRF simultaneously and that a match 
or mismatch signal for the entire MIRF is 
obtained in about 5 microseconds. 

Document MIRF — After all words of the 
search question have been typed, the superposi^ 
tion of their search codes is held in the search 
code accumulator. At the beginning of the 
actual search operation, the flip-flops of the 
search code accumulator are gated to their as- 
sociated drive amplifiers. A particular set of 
drive amplifiers is energized and current flows 
in a corresponding set of primary windings in 
the 80 bit superimposed code field of the Docu- 
ment MIRF. If the detailed bit pattern rep- 
resented by the energized primaries is in- 
cluded in any of the superimposed fields of 
the stored document indexes, a match condition 
is indicated by the match detector. If not, a 
mismatch indication is given. The test is made 
on the entire contents of the document MIRF 
simultaneously and a YES/NO response is 
obtained in about 5 microseconds. 

It should be pointed out that the criterion for 
a match is inclusion, not identity. A document 
index includes the search question if the fol- 
lowing conditions of the superimposed' search 
code portion of the index are satisfied. First, 
for every bit of the index search field that is a 
one, the corresponding bit of the search ques- 
tion is either a zero or one. Second, for every 
bit of the index search field that is a zero the 
corresponding bit of the search question is a 
zero (in other words a binary one includes both 
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a one and a zero, but a binary zero does not in- 
clude a binary one). 

b. Generating Numbers by the MIRF Proc- 
ess — The generation of the serial number of an 
input descriptor illustrates this operation. As- 
sume that an English word has been typed in 
and that the test for valid descriptor is true. 
Because a match is obtained when the alphabetic 
descriptor register is gated to the Dictionary 
MIRF, one item wire in the MIRF is effectively 
isolated: namely, the wire that is uniquely re- 
lated to the input descriptor. The detailed wir- 
ing pattern of this wire in a group of cores out- 
side the alphabetic code field contains the binary 
serial number of the input descriptor. By gat- 
ing the alphabetic descriptor register to the 
MIRF and at the same time causing current 
to flow in the primary winding of a core that 
is in the serial number portion of the MIRF, 
the binary value associated with that core for 
the selected line can be determined. The pres- 
ence of current in the additional winding tests 
for a binary one in that position. If the match 
detector indicates a match, the value is indeed 
one. However, if a mismatch is obtained, the 
value must be zero. 

The sequence for generating the serial num- 
ber is as follows: First the flip-flop register 
that will eventually hold the serial number is 
cleared to all ones. Then the alphabetic descrip- 
tor register is gated to its drive amplifiers and a 
drive amplifier associated with the parity bit of 
the serial number is energized. The output of 
the match detector is observed. If a match 
condition is observed, it is known that the 
parity bit is actually a one and the parity bit 
flip-flop in the serial number register is not 
changed. If a mismatch is observed, it is known 
that the parity bit is zero and the parity bit 
flip-flop in the serial number register is not 
to zero. The next step is to energize the drivers 
associated with the alphabetic descriptor reg- 
ister and a driver associated with the least 
significant bit of the serial number. Again the 
output of the match detector is observed and 
the flip-flop assigned to the least signiflcant bit 
is either allowed to stay at one or is changed to 
a zero. This procedure continues for thirteen 
steps. At the end of this time, the 12-bit serial 
number and its parity bit will have been gen- 
erated and stored in the serial number register. 


CIRCUIT DESIGN 

Three principal types of transistor circuits 
are used in the experimental model : transistors 
are used as switches to drive the primary wind- 
ings of the MIRF cores ; discriminator-amplifier 
circuits are used to accept the voltage generated 
on the secondary windings of the MIRF cores 
(this is the match detector circuit) ; and transis- 
tor logic circuits are used for the over-all con- 
trol of the MIRF operations. All three types 
were designed at SRI. 

1. MIRF Driver 

The drive currents that are required by the 
ferrite cores in the Document and Dictionary 
MIRFs are furnished by circuits such as the 
one shown schematically in Fig. 5. Four MIRF 
driver circuits are mounted on one printed cir- 
cuit plug-in board, as shown in Fig. 6. Each 
circuit is capable of supplying the required 2 
amperes at low impedance. The power transis- 
tor that delivers the drive current (Type 
2N1905) is driven by a push-pull emitter fol- 
lower that provides 60 milliamperes of base 
drive current into 2N1905. The output power 
transistor has rise-and-fall time capabilities of 
less than 0.3 microsecond. The actual current 
in the load is nearly linear because of the in- 
ductive nature of the load and builds up to the 
2 ampere amplitude at the end of approximately 
10 microseconds. The overshoot voltage in- 
duced when the transistor is turned off is 
clamped by a silicon diode to — 36 volts. The 
clamp prevents excessive voltage spikes from 
appearing across the output transistor while 
still allowing the load inductance to recover 
within 10 microseconds. 

Two protective features of the MIRF driver 
circuit should be noted. One is a fuse, which is 
inserted in series with the load to protect 
against excessive load currents. Before the 
winding of the magnetic circuits internal to 
the MIRF assembly can be damaged by too 
much current from, say, an accidental short 
circuit, the fuse wire will open up. The second 
protective circuit includes a square-loop 
memory core that is threaded by the lead going 
to the transistor load. This core is normally 
biased off, but if the drive current exceeds a 
safe value the square-loop core will switch and 
induce a voltage in a sense lead. The voltage in 



520 


PROCEEDINGS— SPRING JOINT COMPUTER CONFERENCE, 1964 



Figure 5. Schematic of MIRF Driver. 



Figure 6. Component Assembly of MIRF Driver Board. 

the sense lead is amplified and used to turn off 
the system clock. The purpose of this circuit is 
to protect the 2N1905 transistor against exces- 
sive heat dissipation from currents that are ex- 
cessive but not large enough to burn out the fuse 
wire. 

2. MIRF Discriminating Amplifier 

The electrical output of the MIRF magnetic 
modules is generated by a very large diode gate 
including almost 300 diodes. Under the worst 
conditions a match signal from this array can 
reach a level as high as 0.4 volt. On the other 
hand, a mismatch signal from the same array 
may only generate a potential of 0.6 volt. It is 


necessary for the MIRF discriminating ampli- 
fier to differentiate between these two signals 
and generate a standard logic level output of 
— 6 volts for a mismatch and 0 volts for a 
match. The circuit for the amplifier is shown 
in Fig. 7. In order to distinguish between very 
closely spaced match and mismatch signals, two 
thresholds are employed in the amplifier. The 
first threshold is provided by a /1N3605 silicon 
diode at the input to the amplifier. This diode 
does not pass signals unless they exceed ap- 
proximately 0.5 volt. After passing the first 
threshold, the signal is amplified in a feedback 
amplifier with a gain of about 50. If the ampli- 
fied signal then exceeds the second threshold 
of 3 volts, a mismatch signal is delivered at 
the output of the amplifier. 

3. Logic Circuits 

In the flip-flop register and over-all control 
circuits, resistor-transistor logic is used. Highly 
reliable circuits that operate in the 100-kc 
frequency range have been developed. The 
basic gate circuit is shown in Fig. 8. This cir- 
cuit in typical use performs a simple majority 
operation. If one or more of its three inputs 
are at a negative potential, the output is held 
at ground potential. Since ground is defined as 
the one state in this system, and a — 6 volt 
potential is defined as a zero state, the basic 
gate performs the “not and” or NAND opera- 
tion. 

All the passive components shown in Fig. 8, 
plus one resistor and two capacitors, are con- 
tained in one physical element supplied by 
Centralab, Inc. These components are screened 
on a passive substrate to a tolerance of 3% 
for the resistors (5% design tolerance) and 
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Fig:ure 9. Component Assembly of Gate-Logic Board. 

10% for the capacitors. The substrates are 
encapsulated with a Durez coating, and are 
ready for mounting to a printed circuit card 
via their projecting leads. 

The gate circuit is a basic part of every logic 
circuit employed in the machine. By itself it 
performs the combinatorial function of logical 
conditions. Two gate circuits properly inter- 
connected form a bistable, or flip-flop, circuit. 
Two gate circuits interconnected in a slightly 
different way form a monostable, or one-shot. 


circuit. The gate circuit is also used as a pre- 
amplifier for an emitter-follower circuit. The 
basic logic circuits, e.g., gates, one shots, flip- 
flops, etc., are mounted on plug-in logic boards. 
A typical logic board, with seven gate circuits 
mounted on a printed circuit board, is shown 
in Fig. 9. 

MAGNETIC DESIGN 

1. General Considerations 

The magnetic design of a MIRF unit is 
centered in the individual magnetic core, which 
acts as a transformer with a multiturn primary 
winding and many single-turn secondary wind- 
ings. When current flows in the primary wind- 
ing, the magnetic core must be capable of 
producing a flux change of sufficient time dura- 
tion and amplitude to generate the desired sig- 
nal in secondary windings. The amplitude of 
the induced voltage is determined primarily by 
the characteristics of the diode associated with 
the secondary winding. The duration of the 
induced voltage is determined primarily by 
noise on the secondary winding and the con- 
sequent delay required before sampling of the 
output can be accomplished. 

The cross-sectional area of the magnetic core 
is proportional, to the product of the amplitude 
and duration of the voltage induced in the 
secondary windings (this is usually referred to 
as the volt-second area of the induced voltage 
pulse). This was kept reasonably small by us- 
ing a high-quality germanium diode (the 
1N500) which requires a back-biasing voltage 
of only 0.6 volt in order to perform properly in 
the diode circuit associated with the input to 
the discriminating amplifier. The circumfer- 
ential length of the magnetic core is determined 
primarily by the number of secondary windings 
associated with the core and the mechanical de- 
sign of the supports for these windings. In 
the MIRF units of the experimental equipment, 
the core has the capacity for 2,000 secondary 
windings. The core’s mean circumferential 
length is 7 inches ; its cross section is a square, 
Yi inch on a side. 

Two other considerations influenced the selec- 
tion of the magnetic cores used in the MIRF 
units. One is the requirement that the core be 
made in two pieces so that the array of cores 
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can be separated into two portions to facilitate 
initial wiring and changes in wiring. The other 
is the necessity of using commercially avail- 
able parts. The number of cores needed in 
this experimental equipment is too small to 
justify the design and production of a core 
of special size or shape. 

2. Details of the Dictionai^y and Document 
MIRF Units 

The individual cores used are the same for 
both the Dictionary and Document MIRF. Each 
core is composed of two U-shaped ferrite struc- 
tures (Allen Bradley part no. UC 892-141C), 
which have been specially modified at the fac- 
tory to permit a maximum of 0.0005 inch air 
gap in each leg when two such structures are 
joined together to produce a MIRF core. To 
drive each core, a twenty-turn primary winding 
is provided. This consists of two ten-turn 
windings distributed in such a manner as to 
minimize the leakage flux and the resulting 
noise signal (see Fig. 10). The primary wind- 
ing drives the core from an 18-volt voltage 
source through a transistor switch driver. The 
output voltage induced upon each secondary 
winding is an essentially rectangular voltage 
pulse having a droop of 0.1 volt in 10 micro- 
seconds, from 0,8 volt at the leading edge to 
0.7 volt just prior to the trailing edge. The 
maximum primary current, 0.7 ampere, occurs 
at 10 microseconds after the beginning of the 
pulse. To accommodate the expanded capacity 
of the MIRF document file (5,000 documents) 
three primary windings will be driven in 



Figure 10. Details of Primary Windings. 


parallel, so that a maximum driver current of 
2.1 amperes is required. 

The performance requirement of the mag- 
netic circuits is that consistent and easily 
separable match and mismatch signals be gen- 
erated at the diode end of the item wires (see 
Fig. 2) when a set of primary windings is 
driven. The design objective was that a maxi- 
mum match signal of 0.1 volt and a minimum 
mismatch signal of 0.6 volt should be realized 
within 1.5 microseconds after the application 
of the primary drive pulses, and that pulsing 
of the MIRF cores be repeated for many cycles 
at a 50-kc clock rate. To achieve these goals, 
noise due to ringing and leakage flux had to 
be minimized. 

A MIRF unit contains many cores (the Docu- 
ment MIRF has 234 and the Dictionary MIRF 
has 140), each with a separate primary wind- 
ing; further, each core is associated with more 
than a thousand single-turn secondary wind- 
ings. The secondary windings pass through or 
around all cores in the unit and so form a long 
rope. The capacitance between wires in the 
rope, the inductance of these wires, and the 
inductance of the primary windings are inter- 
coupled in a very complex manner. In the de- 
velopment of the MIRF units, substantial noise 
on the secondary (item) windings was experi- 
enced due to ringing currents in the primary 
windings. This noise was reduced to a negligi- 
ble level by inserting a Type DI52 diode in 
series with each primary winding and shunting 
each primary by a 1000 ohm resistor. A low- 
amplitude noise signal of about 5 Me, due to 
inductance and inter-item capacitance of the 
secondary windings, was also observed. Such 
noise could be reduced to a very low level by 
filtering at the input to the discriminating am- 
plifier, but in the experimental system this was 
not necessary. 

Noise due to leakage flux must be kept small 
in order to hold the maximum match signal at 
0.1 volt. A secondary wire that represents a 
match item must pass outside all energized 
cores. Since in the worst case, 57 cores may be 
energized, the maximum permitted noise due 
to leakage flux at each core is less than 2 milli- 
volts (this corresponds to a leakage flux of i/4 
of one per cent at each core). In the experi- 
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mental model two methods are used to reduce 
leakage flux. One is distributing the primary 
winding on the cores to compensate for the mag- 
netic potential drop by a corresponding rise 
in magnetic potential at the points where the 
drop occurs. As Fig. 10 shows, the winding 
has a linear spacing except at the points where 
the air gaps occur ; there two turns are closely 
spaced. The second method uses cancellation 
of induced voltages to reduce the effect of leak- 
age flux. The common end of many item wires, 
instead of being connected to ground, as shown 
in the simplified diagram of Fig. 2, is actually 
connected to a wire that lies in the item wire 
rope and passes outside of all cores. The volt- 
age induced in the “cancellation lead” at any 
core by leakage flux is approximately equal to 
that induced in item wires and is opposite in 
polarity (relative to the input terminals of the 
discriminating amplifier). 

MECHANICAL DESIGN 
1. The MIRF Module 

Implementing the wiring-patterns-on-cores 
method of storage illustrated by Fig. 2 pre- 
sented a challenging mechanical design prob- 
lem. It was necessary that the physical struc- 
ture containing the magnetic cores and the as- 
sociated wiring be made in two parts that could 
be easily separated. It was desirable to fabri- 
cate submodules of wiring patterns, so that the 
permanently stored information could be 
changed mechanically in relatively small blocks. 

Separate MIRF modules are used to store the 
information concerning document indexes and 
dictionary words. In each, the cores are ar- 
ranged in a rectangular pattern and are sup- 
ported by long bobbins. These bobbins are 
firmly attached to a base structure and carry 
the primary windings for the cores. A MIRF 
module is a complete assembly of magnetic 
cores, primary windings for the cores, and sub- 
modules of secondary windings with their asso- 
ciated diodes. The construction of a module is 
illustrated by the exploded view of Fig. 11. The 
principal parts of the assembly are the base, 
or coil bobbin, assembly and the item wiring 
trays. 

The coil bobbin assembly consists of a field 
of paper bobbins (two per magnetic core) that 


are cemented to a V^-inch-thick phenolic board. 
Each bobbin carries a ten-turn winding. The 
windings on pairs of bobbins are connected in 
series to form the primary winding for one 
of the magnetic cores. An item tray is a %6- 
inch thick phenolic board with a field of shallow 
bobbins that matches the field of coil bobbins. 
The bobbins on the item tray are slightly larger 
than the coil bobbins, permitting item trays to 
be stacked up on the coil bobbin assembly. One 
item tray can accommodate 286 item wires. The 
diodes that are connected in series with the 
secondary windings and form the input circuit 
to the discriminating amplifier are mounted on 
the edge of the item tray. A MIRF module is 
assembled by sliding up to seven item trays 
into position on the coil bobbin assembly. One 
set of U cores is then inserted into the set of 
coil bobbins and held in place by a plate with 
a silicone-rubber pad. The other set of U cores 
is then dropped into position on the opposite 
side of the bobbin coils. Finally, the top plate 
(also with a spongy pad) is dropped into posi- 
tion to hold the entire assembly intact. The 
two sets of U cores are held together under 
slight pressure from the silicone pads. 



Fignire 11. Exploded View of MIRF Module. 
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A complete item tray is shown in Fig. 12. The 
item wires start in the upper left corner of the 
trays, where they are connected to a common 
bus bar. They pass from left to right in the 
first row of cores, then back and forth until 
they emerge in the lower left center part of 
the tray. The wires then run to assemblies of 
diodes, where each wire is connected to its own 
individual diode. The output side of the diodes 
(the cathodes) are connected together and 
wired to a small connector, which is seen in 
the lower left hand portion of the tray. Even 
though each tray contains detailed wiring for 
286 items, only two wires run from the tray 
to the external discriminating amplifier. Fig- 
ure 12 also shows a pair of primary coil bobbins 



Figure 12. MIRF Item Tray. 



Figure 13. Close-up of Document MIRF Module 
(Top Plate Removed). 


with the two U cores inserted. A closeup of a 
MIRF module with the top plate removed is 
shown in Fig, 13. The tops of one set of U 
cores can be seen as well as four item trays. 
The connectors for the output of the item trays 
can be seen in the lower center part of the 
photograph. The discriminating amplifier cir- 
cuits (one for each of the seven item trays that 
can be included in a module) are located on 
the circuit board that is mounted in front of 
the magnetic module. 

2. Wiring of the Item Trays 

The item trays in the Document and Diction- 
ary MIRF units store more than one-third of 
a million bits of information. To ensure the 
greatest possible accuracy of the wired-in in- 
formation, two steps were taken. First, the 
raw data for the documents were computer- 
processed to give a set of punched cards that 
contain the detailed wiring information. Sec- 
ond, a wiring scheme was devised, which pre- 
sented the detailed wiring information to a 
wireman in a very simple form, and which in- 
cluded a means of checking the accuracy of the 
wiring as the wiring was actually done. In this 
scheme, the path that a wire was to take was 
delineated by a set of lights in an array of 
incandescent lamps. 

An over-all view of the item-tray wiring 
equipment (wiring aid) is shown in Fig. 14. 
The empty wiring tray is placed on the wiring 
jig in front of the operator. A card is then 



Figure 14. Over-all View of Item Tray 
Wiring Equipment. 
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placed in the punched-card reader and a pattern 
of lights is set up in the wiring jig. Number 
36 Nyleze wire is taken from a spool through 
a tensioning device to the top of a special wir- 
ing tool (shown in the hand of the operator). 
The wire from the bottom of the wiring tool 
is first soldered to the common bus shown in 
the upper left part of the wiring tray. The 
tool is then moved along the path specified by 
the pattern of lights, leaving the wire wound 
in the desired pattern around the item tray 
bobbins. Correct wiring at a bobbin is indicated 
by a light turned on to yellow brilliance. If a 
light is off, or is on at white brilliance after 
the wiring tool passes a bobbin position, a wir- 
ing error is indicated. 

3. Alternative Method of Fabricating Item 
Trays 

Alternative methods of preparing wired-in 
information that may be more easily automated 
than stringing of small wire have been investi- 
gated. One alternative is illustrated by Fig. 
15, which shows an item conductor in the form 
of a metallic path etched on a thin, copper- 
coated Mylar sheet (half -ounce copper on 2-mil 
Mylar) . It will be noted that the item conductor 
is connected to a bus at the top of the sheet 
and to another bus at the bottom. These copper 
areas are used for connecting the item con- 
ductor to the common bus at one and to a diode 
at the other. This sheet contains one item, but 
two item conductors could easily be placed on 



Figure 15. MIRF Item Conductor Formed by 
Metallic Path on Mylar Sheet. 


one sheet, one being associated with one leg 
of the magnetic core and the other with the 
other leg. The experimental model contains 
a submodule of 75 items on Mylar sheets. 

DELIVERED EXPERIMENTAL EQUIP- 
MENT 

The experimental Multiple Instantaneous Re- 
sponse File System is an all-solid state equip- 
ment. Transistor drive circuits capable of 
supplying two amperes of current to magnetic 
circuits, special discriminating amplifiers capa- 
ble of operating reliably with a poor signal-to- 
noise ratio input signal, and transistor logic 
circuits were designed for high reliability, low 
cost, and moderate speed. About 300 current 
drive transistors, 2500 logic transistors, 2500 
printed gate circuits (a group of 6 resistors, 2 
capacitors and their interconnecting wiring on 
a passive substrate) and 5,000 diodes are used 
in the system. Except for sequences involving 
the input-output typewriter, the system oper- 
ates synchronously under the control of clock 
pulses derived from a 50-kc transistor multivi- 
brator. 



Figure 16. Front View of Experimental 
MIRF Equipment. 
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Fig-ure 17. Rear View of Experimental MIRF 
Equipment (Doors Removed). 


The experimental equipment shown in Figs. 
16 through 18 was delivered to Rome Air De- 
velopment Center in July, 1963. A front view 
of the equipment is shown in Fig. 16. The main 
equipment cabinet, the input-output typewriter, 
and the display and control unit can be seen. 
Figure 17 shows a rear view of the equipment 
cabinet with the doors removed. The right hand 
portion of the cabinet contains logic circuits 
for control of the system, arranged in modules 
of plug-in transistor logic boards. The Diction- 
ary MIRF unit is contained in the center por- 
tion of the cabinet. Directly beneath the MIRF 
unit are two modules of drive circuits which 
provide current to the MIRF. In the left hand 
portion of the cabinet are the Document MIRF 
and the transistor circuits for providing drive 
currents to it. It will be observed that space 
has been allowed for one additional MIRF unit 
in the center section and for two additional 
MIRF units in the left hand section. This is to 
provide for the expansion of the Dictionary 
MIRF to 3,000 words and expansion of the 
Document MIRF to 5,000 document indexes. A 



Figure 18. Front View of Equipment with Document 
MIRF Module in Extended Position. 


front view of the cabinets that house the MIRF 
units and their drivers is shown in Fig. 18. 
Here the Document MIRF unit has been pulled 
out to show it in its extended position. Below 
the MIRF units the wiring side of the tran- 
sistor drive modules can be seen. 

The format of the typewritten record of a 
search in the experimental model is shown in 
Fig. 19. The first two lines, “Stanford Research 
Institute Project 4110,” etc., are a manually 
typed heading for the subsequent search. The 
heading was typed while the typewriter was 
effectively disconnected from the rest of the 
equipment. The search question consists of 
three words: “coding,” “computers,” “digital.” 
This line was also typed manually. The rest 
of the printout is the machine’s response to the 
search question. Seven documents responded. 
For each one, a four-digit accession number 
and the English words that describe the docu- 
ment are printed on a single line. The asterisk 
prefix on some words have been copied from 
the ASTIA abstract. It will be observed that 
the three search words appear in every respond- 

STANFORD RESEARCH INSTITUTE PROJECT 4110 
MULTIPLE INSTANTANEOUS RESPONSE FILE 
CODING, COMPUTERS, DIGITAL. 

0156 *CODING, DIGITAL COMPUTERS, DATA PROCESSING SYSTEMS, LANGUAGE, 

0201 RADAR PULSES, RADAR SIGNALS, *COD!NG, DIGITAL COMPUTERS, 

0420 DESIGN, DIGITAL COMPUTERS, *LANGUAGE, CODING, ANALYSIS, 

0540 DIGITAL COMPUTERS, ERRORS, LANGUAGE, CODING, MATRIX ALGEBRA, 

0727 *LANGUAGE, *CODING, *HANDBOOKS, DATA PROCESSING SYSTEMS. DIGITAL COMPUTERS, 
0732 DIGITAL COMPUTERS, CODING, TELETYPE SYSTEMS, DISPLAY SYSTEMS, MAPS, 

0824 DATA PROCESSING SYSTEMS, DIGITAL COMPUTERS, OPERATIONS RESEARCH, CODING, 

Figure 19. Format of Typewritten Record of a Search. 
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ing set of indexes. It should be especially noted 
that the search words appear in different posi- 
tions and different order in the different re- 
sponding documents. This independence of 
order of the search words and the position of 
the corresponding descriptors in the document 
indexes is an important result of the superim- 
posed coding of the search field. 

CONCLUSIONS 

From experience with the Experimental 
MIRF it is concluded that interrogation of the 
magnetic storage units and the over-all control 
of the system can be accomplished with reliable 
circuits of modest complexity. Storage of the 
document index information in wiring associ- 
ated with arrays of cores that are physically 
separable appears feasible ; arrays of cores can 
be separated, submodules of wired information 
can be changed, and the core arrays reassembled 
in a reasonably short time. More work on the 
mechanical design of the magnetic modules is 
needed, however, to permit easier and faster 
changing of the stored information. Based on 
the performance of the experimental model, 
which contained a file of more than 1,000 docu- 
ment indexes, it is concluded that with the pres- 
ent design a system building bloc k should con- 
tain about 5,000 document indexes. It appears 
that as many as ten such building blocks could 
be combined in a system whose over-all control 
is little more complex than that for a single 
building block. Therefore it is concluded that 
files of the order of 50,000 indexes could be 
built with no major changes in the basic con- 
cepts or circuits used in the experimental 
model. 

Easy communication between a human oper- 
ator and the Experimental MIRF System has 
been demonstrated. The machine’s response to 
a search question is essentially instantaneous 
in terms of human reaction time and the in- 
formation content of the response is sufiicient 
to allow the operator to start the document 
search with a general question and to use the 
information received to define a more specific 
question. In this way it is possible to home-in 
quickly on the documents of special interest. 
Several automatic features of the equipment 


have proved to be useful. One of these is the 
capability of accepting a synonym in the search 
question and automatically translating it into 
the synonymous descriptor contained in the 
machine’s vocabulary. Another feature is the 
capability of automatically modifying the 
search question inserted by the human operator 
and initiating a new search. For example, if 
any of the input words have attached to them 
a “see-also” reference, that see-also reference 
will be substituted for the original word to 
form a new search question. 
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RESEARCH IN AUTOMATIC GENERATION 
OF CLASSIFICATION SYSTEMS 


Harold Borko, Ph,D. 
System Development Corporation 
Santa Monica, California 


INTRODUCTION 

This paper is concerned with the organiza- 
tion of information, in the form of documents, 
for efficient storage and retrieval. By docu- 
ments we mean books, technical reports, arti- 
cles, memoranda, letters, photographs, data 
facts, etc. — all forms of memory file organiza- 
tion ranging from documents in a library to 
data in a real-time command-and-control sys- 
tem. Therefore, the implications of this work 
are applicable to a field broader than the con- 
cerns of the ordinary library. 

In actual practice, m.ost of the inform^ation 
retrieval research has been concerned with docu- 
ment files because the most highly organized 
collection of documents in existence today is the 
library, and in doing research on methods of 
organizing information, one must compare the 
adequacy of proposed new techniques with ex- 
isting library m.ethods. Procedures which will 
improve information storage and retrieval in a 
library will probably be sufficiently powerful 
to help improve other methods of file organiza- 
tion. 

PURPOSES OF DOCUMENT 
CLASSIFICATION 

The reason for maintaining a collection of 
documents is to have an available store of in- 
formation and to be able to retrieve desired 
information rapidly and with confidence. The 
value of classification is that it increases effi- 
ciency in locating this desired information. If 
we tried to locate a book on a particular subject 
in a library that did not use any system of 
classification, we would have to spend a long 


time reading the titles and authors of several 
thousand books before we could find the one for 
which we were looking. If we knew the author, 
and the books were arranged alphabetically by 
author, we could locate the book quickly. On 
the other hand, if we didn’t know the author, but 
knew the subject content of the book, we would 
want the books arranged by subject category in 
order to search the file efficiently. Finally, if 
all we knew was that the book we sought was a 
big black one which we could recognize, we 
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being made is that there are various ways of 
organizing a file, and whether or not a particu- 
lar method of file organization is efficient de- 
pends upon the search strategy. Furthermore, 
no one method of file organization would be 
equally efficient for all search questions. This 
is an important, if obvious, point and one which 
is often overlooked. 

The central theoretical problem of classifica- 
tion as a method of organizing documents is 
that only one principle at a time can be utilized 
for gathering items together. This principle 
can be alphabetic arrangement by author, color 
coding based on the binding of the book, sub- 
ject classification, or and other scheme — as 
long as only one principle is used at a time. 

Since a document collection is a store of in- 
formation, it is usually desirable to organize 
this store according to subject matter. By es- 
tablishing clearly demarcated groups, or classes, 
of documents on related topics, the number of 
documents to be scanned can be reduced to rea- 
sonable proportions. This, in essence, is the 
purpose of classification. A classification system 


529 
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is a scheme for organizing a mass of material 
into groups so that related objects are brought 
together in a systematic fashion. Objects in one 
group are selected so as to be more like each 
other than objects in any other group. How- 
ever, before this aim can be realized, two ques- 
tions must be answered : 

1 ) How many classes shall be established ? 

2) What shall be the measure of similarity 
and, hence, what is the principle to be 
used in determining class membership ? 

If one is interested in automatic procedures, one 
has to answer a third question, namely : 

3) What principles of classification are most 
amenable for use in an automated docu- 
ment classification system ? 

All three of these problem areas are being 
studied, and some results are already available. 

DEVISING A CLASSIFICATION 
SCHEDULE 

The classification of knowledge is not a new 
problem. Even in ancient times, man sought to 
organize information of the world around him 
into categories for efficient retrieval. What is 
new, perhaps, is the application of mathemati- 
cal techniques to the classification problem. The 
older forms of classification, from ancient times 
through the Dewey Decimal System, were at- 
tempts to impose logical subdivisions on the 
whole field of knowledge. The surprising thing 
is not that these systems were imperfect, but 
rather that they succeeded as well as they did. 
Melvil Dewey first proposed his Dewey Decimal 
System in 1876, and it is still in extensive use. 
Now, as a result of new inventions and acceler- 
ated research, the traditional boundaries be- 
tween the sciences are breaking down. It is 
time to reexamine the concept of classification, 
to go back to basic principles and to study the 
various methods of deriving a classification 
system. 

Factor Analysis — Borko 

In 1958, Tanimoto ^2 published a theoretical 
paper on the applications of mathematics to the 
problems of classification and prediction. Spe- 
cifically, he pointed out how the problems of 
classification can be formulated in terms of sets 
of attributes and manipulated as matrix func- 


tions. An actual application of matrix mathe- 
matics to the analysis of a collection of docu- 
ments was made by Borko^ in 1961. The aim of 
this study was to determine whether it was 
possible to derive a reasonable classification 
schedule for a collection of documents by factor 
analysis,^ a mathematical technique which en- 
ables one to isolate the underlying variables in 
a domain of events. This method has been used 
by psychologists to determine the underlying 
variables of intelligence, personality, creativity, 
ability, etc. 

In Borko's classification study, factor anal- 
ysis was used to discover the relationship of 
key content words as they are used in psy- 
chological literature. Approximately 600 psy- 
chological abstracts were selected for study. 
These were key punched in their entirety, and 
by means of a computer program called FEAT^ 
(Frequency of Every Allowable Term), a fre- 
quency count was made of all words, and 90 tag 
terms were selected for further analysis. These 
data were arranged in the form of a matrix 
consisting of 90 terms and 618 documents. A 
portion of this matrix is reproduced in Table 1. 
The number in each cell represents the number 
of times a given word occurred in a particular 
document. Table 1 shows that the term ‘‘child 
(children)'^ did not occur in document number 
74, occurred twice in document number 307, 
and three times in document number 374. The 
term “level (s)'" occurred once in document- 
numbers 74, 626, and 674 and did not occur in 
the other documents in the example. 


Words 

Case(s) 

Child (ren) 

Factor (s) 

Level (s) 

Psychology (ical) 

School (s) 

Abstract # 74 

0 

0 

1 

1 

1 

0 

307 

1 

2 

0 

0 

0 

1 

321 

0 

2 

1 

0 

1 

0 

57S 

1 

2 

0 

0 

1 

0 

626 

0 

1 

0 

1 

0 

2 

647 

1 

2 

0 

0 

1 

0 

653 

4 

1 

0 

0 

1 

0 

674 

1 

3 

0 

1 

0 

0 


Table 1. A Portioln of the Document Term Matrix 
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Based upon the data in the document-term 
matrix, one can compute the degree of associa- 
tion among the terms as a function of their oc- 
currence in the same set of documents. A meas- 
ure of this association is the correlation coeffi- 
cient. This is a decimal number which varies 
from +1.000 to — 1.000. A +1.000 would mean 
a perfect correlation, namely, that every time 
word X occurred, word Y appeared in the same 
document; a zero correlation would indicate no 
relationship; and a negative correlation would 
mean that if the' word X occurs in a document, 
then word Y is not likely to occur. 

The formula for computing' the correlation 
coefficient is as follows : 

_ NsXY— (SX) (SY) 

V [N2X-— (5X)-] (SY)=^] 

By applying this formula and computing the 


correlation between each of the 90 words with 
every other word (a total of approximately 
4000 correlations), one creates the term-term 
correlation matrix (Table 2). This matrix ex- 
presses the actual associations which occurred 
among selected words in a sample of documents. 

These statistical procedures, preparatory to 
the factor analysis, are important in demon- 
strating a method for translating a conglomer- 
ation of words and documents into a set of vec- 
tors which can be processed mathematically. 
Factor analysis, when applied to the correla- 
tion matrix, enables one to determine the basic 
underlying variables which account for the rela- 
tions among the words as expressed in the 
vectors. It enables us to mathematically deter- 
mine which words are related and form a set; 
these sets, in turn, are interpreted as classifica- 
tion categories for grouping the original sam- 
ple of documents. 



Ability 

Achieve- 

ment 

Activity 

Analysis 

Anxiety 

AhiHtv 


.272 

—.028 

.048 

.080 

Achievement 

.272 


—.026 

—.041 

.119 

Activity 

—.028 

—.026 


—.002 

—.025 

Analysis 

.048 

—.041 

—.002 


.030 

Anxiety 

.080 

.119 

—.025 

.030 



Table 2. A Portion of the Correlation Matrix 


In the experiment just described, the original 
90-column matrix was reduced to 10 vectors 
which accounted for 62% of the total, and it is 
assumed most all of the common, variance. 
These vectors were then rotated mathematically 
to achieve a simpler and more meaningful struc- 
ture of the hyperspace. They were then inter- 
preted by the investigator as ten classification 
categories into which the original sample of 
618 psychological reports could be grouped — 
and by implication, all psychological literature. 

To illustrate how the factors were inter- 
preted, let us examine the words which had 
significant loadings on the first factor. 


Term # 

Word 

Factor 

Loading 

33 

girls 

.74 

10 

boys 

.73 

70 

school 

.30 

2 

achievement 

.20 

63 

reading 

.18 


There were only five words with significant 
loading. It is fairly obvious that the concept 
underlying these terms deals with the achieve- 
ment of boys and girls in school ; consequently, 
this factor was interpreted as academic achieve- 
ment of boys and girls in school ; consequently, 
in a like manner. These included factors named 
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experimental psychology, social psychology and 
community organization, school guidance and" 
counseling, clinical psychology and psycho- 
therapy, etc. 

Thus, we have arrived at answers to the two 
questions posed earlier in this paper: How 
many classes should be established, and what 
shall be a measure of similarity? The applica- 
tion of factor analysis enables one to determine 
the number of categories which should be es- 
tablished in order to adequately describe a 
given sample of documents. Furthermore, it 
provides a statistical technique, or principle, 
for measuring the similarity of content based 
upon the co-occurrence of key content terms. 

There are many questions still to be answered 
before one can decide on the usefulness of this 
technique for classification. These questions 
include : 

1) Are the categories stable; do they hold 
from one sample of psychological litera- 
ture to another? 

2) Are the categories valid; can all docu- 
ments be reasonably classified into these 
categories? 

3) Are the categories useful; do they lend 
themselves to automated document clas- 
sification ? 

4) Is the technique a general one; can it be 
applied to documents other than psycho- 
logical reports ? 

Before reviewing the studies designed to an- 
swer these questions, it would be well to first 
examine some other mathematical techniques 
for deriving classification schedules. 

Clump Theory — Parker-Rhodes and Needham 

At the Cambridge Language Research Unit in 
England, Parker-Rhodes was also interested in 
classification theory and a mathematical basis 
for forming classes of documents. Interest- 
ingly, he considered the use of factor analysis 
but rejected it on two grounds, one theoretical 
and the other practical. From a theoretical 
point of view, Parker-Rhodes claimed that ‘‘the 
statistical type of technique has its place only 
after we have discovered whatever classification 
there may be. For then it is up to the statis- 


tician to say how nearly the properties of par- 
ticular elements of the universe are inferable 
from a statement of the classes to which each 
belongs. . . . This is quite a different enterprise 
from that of finding the classes themselves''^® 
(page 4) . On the practical side, factor analysis 
is rejected as being incapable of handling 
“really large universes." 

Having decided to avoid the statistical con- 
cept of determining the probability of class 
membership, Parker-Rhodes restructured the 
problem in terms of locating clumps “in a 
Boolean lattice representing all possible subsets 
of the universe." Within a Boolean lattice there 
are many ways of defining clumps, and in 
fact, many different clumps are defined. With- 
out getting involved in details, it can be broadly 
stated that “members of a clump must be more 
like each other, and less like non-members, than 
elements of the universe picked at random" 
(page 9) . Thus we see the relationship between 
the theory of clumps and the theory of clas- 
sification. The method used for locating clumps 
within the lattice remains to be worked out. 
Initial procedures for clumping are described 
by Needham.® Research aimed at improving 
and testing these procedures is still going on. 
However, even now these techniques have been 
applied to a 346 x 346 matrix which is beyond 
the capabilities of presently available factor 
analysis programs. 

Latent Class Analysis — Baker 

The similarity between document classifica- 
tion and the problems inherent in the analysis 
of sociological questionnaire data was recog- 
nized by Baker. He then proposed an informa- 
tion retrieval system based upon Lazarsfeld's 
latent class analysis. ^ As Baker points out, 
“The raw data of documents, the presence or 
absence of key words, is amenable to latent 
class analysis without modification of either the 
analysis or the data. The latent classes and the 
ordering ratios yielded by the analysis provide 
the basis for a straightforward means of classi- 
fication and retrieval of documents" (page 
520). 

The latent class model assumes that the popu- 
lation — that is, the number of documents in the 
sample — can be divided into a number of mutu- 
ally exclusive classes. Usually the number of 
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classes is determined by the investigator, al- 
though it is conceivable that this parameter can 
be determined mathematically. One starts by 
selecting the key words which characterize each 
class of documents. Then latent class analysis 
is used to compute the probability that a docu- 
ment having a certain pattern of key words 
belongs to a given class. 

Baker gives the following example: Let us 
assume that we have 1000 documents in our file. 
We are interested in classifying these docu- 
ments into two classes — ^those dealing with com- 
puter automated instruction and those not 
directly related to this topic. We select as the 
key words in our search request the following : 

1. computer. 

2. automated. 

3. teaching. 

4. devices. 


Each of the 1000 documents are then analyzed 
to determine whether they contain one or more 
of the four terms. Sixteen (2^) response pat- 
terns are possible, ranging from ++++ to 
0000. A test enables one to estimate the latent 
structure from the observed data. Having ob- 
tained a latent structure which fits, one can 
compute an ordering ratio, which is the prob- 
ability that a document having a given word 
pattern belongs to a particular latent class. For 
example, a document with all four key words 
present has a probability of .998 of belonging 
to class 1, i.e., it is concerned with computer 
automated instruction. 

Table 3 shows the relationships between the 
response pattern, expected frequencies, and 
ordering ratios of the 1000 documents analyzed, 
in terms of their latent class structure. 


Response 

Pattern 

Expected 

Frequency 

Total 

Fitted 

Ordering Ratios 

Class 1 

Class 2 

Class 1 

Class 2 

+ + + + 

158.76 

.24 

159.00 

.998 

.002 

+++0 

105.84 

2.16 

108.00 

.980 

.020 

++0 + 

68.04 

.96 

69.00 

.986 

.014 

+0++ 

68.04 

2.16 

70.20 

.969 

.031 

0+++ 

17.64 

.56 

18.20 

.969 

.031 

++0 0 

45.36 

8.78 

54.14 

.838 

.162 

+0 + 0 

45.36 

19.44 

64.80 

.700 

.300 

0++0 

11.76 

5.04 

16.80 

-.700 

.300 

+ 00+ 

29.16 

8.60 

37.76 

.772 

.218 

0+0 + 

7.56 

2.39 

9.95 

.768 

.232 

00++ 

7.56 

5.04 

12.60 

.600 

.400 

+ 000 

19.44 

77.76 

97.20 

.200 

.800 

0+00 

5.04 

20.16 

25.20 

.200 

.800 

00+0 

5.04 

45.36 

50.40 

.100 

.900 

000+ 

3.32 

20.16 

23.48 

.142 

.858 

0000 

2.16 

181.20 

183.36 

.012 

.988 


Table 3. Expected Frequency of Response and the Ordering Ratios Based Upon the Estimated Latent Structure^ 


The table readily reveals the applicability of 
latent class analysis for information retrieval. 
This application is still in the theoretical and 
experimental stages. It has yet to be tested 
with empirical data from actual files. 


AUTOMATED DOCUMENT 
CLASSIFICATION 

The preceding discussions of factor analysis^ 
clump theory, and latent class analysis all dealt 
with methods for devising empirically based 
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classification categories. These and other re- 
searchers have been investigating mathematical 
methods for deriving classification categories 
because of their belief that empirical classifica- 
tion systems will provide a more efficient means 
for the classification and the retrieval of in- 
formation than the traditional methods of docu- 
ment classification. This belief has been sub- 
jected to scientific tests and evaluations. 

In a study by Borko and Bernick,^ an attempt 
was made to test the hypothesis that a classifica- 
tion system derived by factor analysis provides 
the best possible basis for automatic document 
classification and would result in more ac- 
curate automatic classification of documents 
than would be possible using more traditional 
classification categories. Maron,^ in pursuing 
his interests in automatic indexing and clas- 
sification, worked with 405 abstracts of com- 
puter literature which had been published in the 
IRE Transactions on Electronic Computers, 
Volume EC-8. In essence, Maron proposed a set 
of 32 subject categories which he felt were 
logically descriptive of the computer abstracts. 
Then he selected 90 clue words in such a manner 
that they would be good predictors of his 32 
categories. The 405 documents were divided 
into two groups — 260 abstracts made up the 
experimental group and the remaining 145 com- 
prised the validation group. Maron classified 
all 405 documents into the 32 categories. Work- 
ing with the documents of the experimental 
group only, he computed the value of the terms 
in the Bayesian prediction equations. He then 
used this formula to automatically classify the 
documents into their categories. Automatic 
document classification was correct in 84.5% 
of the cases in the experimental group and in 
51.8% of the cases in the validation group. 

Borko and Bernick decided to test the hypoth- 
esis that a higher percentage of correct clas- 
sifications could be made using the same set 
of documents if a factor-analytically derived 
classification system were used instead of 
Maron’s logically derived categories. However, 
their results were approximately the same as 
those obtained by Maron. Because of the nature 
of the experimental design used, it was im- 
possible to determine whether the difficulty lay 
in the mathematically derived classification sys- 


tem or whether the factor score method used to 
predict correct document classification was not 
as effective as the Bayesian prediction equation. 

Another series of experiments were designed 
and executed.^ It was concluded from this series 
that, while there was no significant difference 
between the predictive efficiency of Bayesian 
and factor score methods, automatic document 
classification is enhanced by the use of a factor- 
analytically derived classification schedule. Ap- 
proximately 55% of the documents were auto- 
matically and correctly classified. While this 
55% current automatic classification of the 
documents is statistically very significant, it 
will have little practical significance until 
greater accuracy can be demonstrated. 

Up to this point the criterion for correct clas- 
sification has been the human classifier, but this 
is not necessarily the best criterion. We know 
that humans are not perfectly reliable, and 
therefore, it is not possible to predict human 
classification with perfect accuracy. The ul- 
timate criterion of the usefulness of any in- 
dexing and classification system is whether it 
retrieves relevant information in response to a 
search request. Automatic document classifica- 
tion procedures should be evaluated on how 
efficiently they retrieve information and not on 
how well they can match the imperfect human 
classifier. This is a much more difficult problem, 
but research is already under way to evaluate 
the retrieval effectiveness of automatic docu- 
ment classification. As work progresses on the 
evaluation and improvement of techniques for 
automatic document indexing and classification, 
it can be anticipated that the bottleneck which 
now exists between the collection and the 
processing of documents will be eliminated and 
automated storage and retrieval systems will 
become possible. 
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INTRODUCTION 

Information retrieval, like the weather, 
stimulates a good deal of verbal clamor and 
speculation, yet remains vexingly elusive and 
unmanageable. Hopefully, both the weather 
and recorded inform-ation will eventually prove 
amenable to some form of human control. In the 
meantime, there must be a continuing eifort to 
achieve balance in the evolution of the concept 
and equipment aspects of information storage 
and retrieval (IS&R) . It is reasonable to expect 
overemphasis on equipment capabilities. The 
emergence of IS&R as a distinct discipline is 
largely attributable to the significant advances 

11 i^wiiipuLcx . 

Nevertheless, steps must be taken to correct 
the deficit in systems and concepts, or the situa- 
tion will be analogous to a surveyor pacing off 
chains with a precision micrometer. 

The purpose of this review is to assess 
present capabilities in the field and the extent 
to which these capabilities are effectively uti- 
lized. However, in view of the clamor that has 
already been made about IS&R, the inclination 
here is to avoid a massive item-by-item listing 
of available systems and techniques. Instead, 
general classes of the operations and equip- 
ment involved in IS&R are summarized and 
evaluated. Specific examples are cited for illus- 
tration. 


In anticipation of the rather broad category 
of readers to whom this paper is directed, an 
attempt is made to reasonably satisfy those with 
considerable background in IS&R as well as 
those with less familiarity. Consequently, an 
attempt is made to avoid laborious and detailed 
descriptions of coordinate indexing, storage 
media, etc. At the same time, it is recognized 
that some readers may, for example, be tech- 
nically expert in hardware but require some 
familiarization with the specific software 
notions of IS&R. It is emphasized, therefore, 
that the central aim is to present a critique 
rather than to educate or merely inform. 

These considerations necessitate inclusion of 
the somewhat fundamental (though cursory) 
earlier sections on hierarchic and coordinate 
indexing. They should be ignored or quickly 
scanned by the informed reader. Those who 
have been moderately exposed to the subject 
should be interested in the sections on Prob- 
lems of Coordinate Indexing and, to a lesser 
extent, on Relationships in Coordinate Index- 
ing. The descriptions and evaluations of prob- 
abilistic and automatic indexing should be use- 
ful even to those with extensive documentation 
background. 

The greater portion of the review is devoted 
to a discussion of indexing for this is considered 
to be the most important single factor in IS&R. 


537 
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For example, it is impossible to embark upon an 
exposition of coordinate indexing without 
touching on the coordination problems that de- 
velop at the retrieval end (''false drops” as one 
instance) . Considerable space in the section on 
indexing is in fact dedicated to the problem of 
coordination failures. 

The last part of the section on indexing 
touches lightly on the subject of storage media 
to convey a feeling for the ways in which in- 
dexes are made readily available to the brows- 
ing inquirer. 

The remaining aspects of information han- 
dling — storage and retrieval, abstracting, dis- 
semination, reproduction and display, and com- 
munication links — are treated more summarily. 
Conclusions are presented at the close of the 
review. 


INDEXING 

The schemes for indexing documents are mul- 
tifarious and growing. There are any number 
of ways in which all the existing and presently 
conceivable schemes may be organized, but the 
plan in Figure 1 seems convenient for obviating 
some of the ambiguities and redundancies usu- 
ally encountered. 

Kent^^ gives the following definitions of 
document and aspect systems : 

Document systems may be defined as those 
information systems that involve the record- 
ing of all characteristics concerning a single 
document on one record or, less commonly, 
on a single discrete set of records. 

Aspect systems may be defined as those in- 
formation systems that involve the recording, 



Figure 1. Breakdown of Indexing Schemes. 
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on one record or a discrete set of records, of 
the numbers (or other unique identification) 
of all documents that have a characteristic 
(or aspect) in common. 

In general, indexing systems that locate 
specific information are characterized by 
greater depth of indexing than systems that 
merely locate documents. But increasing the 
depth amplifies the concomitant problems by 
orders of magnitude. Indicative indexing is a 
working reality ; information indexing, at least 
where a large general store of information is 
concerned, is a monster whose powerful poten- 
tial needs to be harnessed. 

Almost all present working systems still rely 
on human indexers, even if all other aspects are 
automated. 

CONVENTIONAL SUBJECT 
CLASSIFICATION 

In conventional subject classification, all 
documents and books are filed in a given slot 
according to an established, predetermined 
hierarchy of subjects. This system is poorly 
suited to a modern technical library for the fol- 
lowing reasons : 

Lookup is a two-step operation involving a 
search for documents under a subject head 
followed by a search of subjects under each 
document to choose those which are per- 
tinent. 

Overlapping of disciplines is increasing. 

The process of interpolating new terms (up- 
dating) in a rapidly developing subject is 
cumbersome. 

A conventional universal classification 
scheme may include the entire store of an 
aerospace-oriented library in one small comer 
of its structure, whereas aerospace may span 
a colossal and varied collection of highly over- 
lapping topics. 

Most hierarchies are artificial. The natural 
structure of biology and chemistry does not 
extend to all knowledge. Ebctensive cross 
referencing alleviates the confinement of one 
heading to a reference, but it burdens the 
user and, carried to extremes, renders chaotic 
the already synthetic hierarchies. 


FACETED SUBJECT CLASSIFICATION 

Faceted classifications attempt to introduce 
greater flexibility by permitting the free com- 
bination of various categories or facets. Sub- 
jects are not arranged in a fixed descending 
order but are combined in an order prescribed 
according to classes. The chief advantage in 
addition to flexibility is that the classification 
may be adapted to the information that is being 
classified. 

Nevertheless, a document is permanently 
pigeonholed, and most of the inherent disad- 
vantages of subject classification are still 
present. 

In fact, faceted classification fails in general 
because it does not solve the problems of the 
searcher. If the hierarchy is not a natural one 
(and most are not), it represents a point of 
view. This system is suitable for well-estab- 
lished subjects and, hence, for the filing of cer- 
tain (e.g., reference) books. Enough time has 
elapsed for the user to have become familiar 
with the indexing point of view. However, the 
outlook of the searcher and the order of the 
index may be completely disparate in consider- 
ing a current, complex subject. 

HIERARCHIES AND RETRIEVAL 
EFFICIENCY 

To better understand the problem of classify- 
ing knowledge in rapidly evolving fields, the 
following example is cited. Even if retrieval is 
100-percent effective, problems still remain. 
Herner and Hemeri® found in a recent study 
involving atomic energy materials that the co- 
incidence between the content of research 
reports and reference questions in atomic 
energy is very small. Further investigation 
revealed that once reports have been used as 
sources of current information, their signifi- 
cance falls away rapidly. These results power- 
fully emphasize the fact that the time available 
for matching viewpoints is essentially zero. 

COORDINATE INDEXING 

A natural consequence of the development of 
high-speed computers was the attempt to find 
new ways of processing data including docu- 
mented information in machine-tractable form. 
One reasonable approach was to question the 
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continuing necessity for creating artificial rela- 
tionships between subjects as in hierarchic 
classifications. Every document defined its own 
subject. How then could the relationships be- 
tween concepts and data be determined by the 
actual environment in which they were em- 
bedded? 

The discussion that follows treats the concept 
of coordination as originally conceived. It pre- 
supposes few or no a priori associations be- 
tween concepts. Certain significant words or 
data are taken from the text of a document, and 
the searcher is then free to combine any number 
of such key terms as he chooses in the attempt to 
match, not viewpoints, but what he actually has 
in mind with what the author is actually talking 
about. This approach is the ideal. The ways in 
which it can be implemented and the extent to 
which the ideal is approached in practice need 
to be examined. The methods of implementa- 
tion and storage techniques and will be dis- 
cussed under that subject. 

The ideal is compromised when associations 
are made between index terms. In a recent issue 
of American Documentation, it is argued that 
the introduction of relationships is tantamount 
to denial of Uniterm indexing and return to 
subject heading classification (attributed to 
Cutter in 1876) Certainly lexical and gram- 
matical ties play an important part in informa- 
tion content. But considering the expense and 
state of the art of linguistic analysis,^ the at- 
titude is that coordinate indexing (no relation- 
ships) and subject heading classification (in- 
flexible with respect to association) are at 
opposite poles; something flexible and capable 
of expressing relationship lies between. 

The general concept of coordinate indexing 
is well understood and will be sketched only 
briefly ; it involves three basic steps : 

1. Accession numbers are assigned to the 
documents in the store. Consider 10 docu- 
ments lettered A to J comprising a closed 
store. 

2. From each document, the indexer picks 
out a representative number of word- 
tokens (single occurrences of a word in a 
document) and assigns a code number to 
each word (a given word distinct from 
any other; it may occur in several docu- 


ments). Assuming emergence of a total 
vocabulary of 30 numbered words, the 
result may be depicted as shown in Tables 
1 and 2, 

Table 1. Documents Ordered by Term Number 


(Serial List) 

Document 

Number 

ABCDEFGHI J 

Words 

(D 3 5 3 16 8 21 7 11 28 

2 6 7 12 17 11 22 @ 8 4 

3 4 @ 1^ 2 19 23 8 2 29 

4 8 14 18 20 13 24 (I) 30 

5 9 9 15 25 @ 

10 15 18 26 

11 19 

27 


In this listing, the 30 words are repre- 
sented by 52 word tokens. 


3. The list is inverted by grouping all docu- 
ment accession numbers under a given 
word. For example, word number 1 (cir- 
cled) occurs under document letters A, C, 
H, I, and J. The inverted list looks like 
this (for the first 10 words) : 


Table 2. Documents Ordered by Term Number 
(Inverted List) 


Word 

Number 

123456789 10 

Document 

Number 

AAAAAB(c)©CC 
©EBBC ®®D 

® I D J F 

I I 

J 


A search defined in terms of words 1, 7, 
and would be found in documents C and H, 
which have these numbers in common. 


Problems of Coordinate Indexing 
Indexer Variability 

The practical difficulties incurred by straight 
coordinate indexing arise primarily because the 
effect of viewpoint can never be eliminated. It 
will occur wherever decisions are made, and 
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indexing at the present stage is largely a deci- 
sion-making process. 

In a superficial experiment conducted at 
Space and Information Systems Division of 
North American Avaition, the IS&R group 
selected a series of documents and compared 
the indexes produced by different people (none 
of whom, however, was a trained indexer) . The 
variations were enormous. Without attempting 
to generalize the few results of this test, the fol- 
lowing trends were noted : 

Those who indexed documents within their 
own specialization tended to read (rather 
than scan) the document and pick many more 
than the average number of terms. 

Those with technical background who had not 
specialized in the field tended to be more selec- 
tive and pick terms that seemed to be em- 
phasized either by frequency of occurrence, 
placement in quotes, or other attention-draw- 
ing devices of the author. 

Nontechnical people tended to pick a fair 
number of terms, chiefly selecting those 
which looked ‘TechnicaF’ and often ignoring 
“common’’ terms with a meaning specialized 
in the particular documents. 

Certain individuals consistently pick few 
terms, others pick many. Citing the most 
extreme example, a document on mathe- 
matics was indexed by one person (a mathe- 
matician) with 52 terms and by another (an 
aeronautical engineer) with 5 terms. 

There was no effort to evaluate document 
relevancy as a function of the resultant in- 
dexes. 

Extensive formal studies of this kind are 
being made at Documentation Incorporated 
(indexing reliability tests and study of teach- 
ing/learning aids)® and IBM (study of effect 
of educational background on encoding per- 
formance).®^ Savage reports no significant 
effect of educational level on ability to index.^® 

Clearly, the human factors in manual index- 
ing need to be explored more fully before effec- 
tive automation of the process can be achieved. 
Specifically, indexing techniques and term selec- 
tion need to be assessed in terms of retrieval 
effectiveness. 


Search Variability — Coordination Failures 

A second problem is the variability with 
which the searcher (or requester) can express 
his inquiry. Often it happens that he is unable 
to express it. A document on germanium rec- 
tifiers might be overlooked in a search for ger- 
manium diodes (near-synonym) or for ger- 
manium transistors (inclusive class). Con- 
ceivably, a requester might even verbalize his 
inquiry as “a germanium thing that permits 
current to flow in one direction but not in the 
other.” 

This problem is one of a group that is cate- 
gorized as coordination failures. These are best 
portrayed by example. Returning to the set of 
documents, A to J, and their filial terms, 1 to 30 
(Table 1), the following specific topics and 
words are assigned : 


Document E 

on : sodium chloride, 
potassium chloride, 
and potassium iodide 


Document G 

on : air-to-ground missiles 


Word no. : 2 — potassium 

16 — sodium 

17 — chloride 

18 — iodide 

21 — air 

22 — ground 

23 — missiles 


Table 3 lists the types of coordination failure 
with examples of searches in which they might 
occur. The examples are purposely trivial for 
the sake of illustration. 

Relationships in Coordinate Indexing 

There are some purists who feel that any 
presumption of relationship between index 
terms is flatly a regression to the construction 
of hierarchies with all of their concomitant 
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vexations, subjectivity, and artificiality. Others 
insist that there must be an ideal marriage of 
coordination and classification to produce op- 
timum communication between the information 
store and its user. Sensibility lies with the 


middle group that comprises not only the 
greatest number of documentation practitioners 
(those actually engaged in meeting the informa- 
tion needs of scientists and engineers) but also 
some of the most advanced thinkers in the field. 


Table 3. Examples of Coordination Failures Using Pure Coordinate Indexing 
(Selection and Juxtaposition Only of Terms Occurring in Text) 


Type of 
Coordination 
Failure 

Example 

Request 

Search 

Terms 

— 

Documents 

Retrieved 

Reason for Failure 

1. False coordi- a) 
nation 

b) 

Ground handling 
of missiles 

Sodium iodide 

Ground (22) 
Missiles (23) 

Sodium (16) 
Iodide (18) 

G (air-to- 
ground 
missiles) 

E (sodium and 
potassium 
ehloride, 
potassium 
iodide, but 
NOT sodium 
chloride) 

Insufficient depth of request — 
easily rectifiable by narrow- 
ing specificity of request. 

Need to show stronger rela- 
tionship than mere co-occur- 
rence or juxtaposition. Possi- 
ble solutions (links, roles, etc.) 
are discussed under relation- 
ships. 

2. Incomplete 
coordination 

Air-to-air 

missiles 

Air (21) 
Missiles (23) 

G (air-to- 
ground 
missiles) 

The search terms are sufficient 
to describe the request but in- 
sufficient to describe docu- 
ment G, which is nevertheless 
retrieved. A meager solution 
is the use of logical negation 
(air AND NOT ground), but 
the requester or searcher must 
be aware of possible exclu- 
sions. 

3. Synonym failure 

Table salt 

Table 

salt 

NONE, but 
document E 
is relevant 

Index does not provide for 
synonym “table salt-sodium 
chloride.” A thesaurus or sub- 
ject authority list is needed. 

4. Generic search 
failure j 

All documents 
on alkali 
halides 

Alkali 

halides 

NONE, but 
document E 
is relevant 

Index does not provide for 
generic name of chemical com- 
pounds, i.e., “alkali halides 
NaCl, KCl, KI.” A thesaurus 
or authority list is required. 

5. False relation- 
ship 

Ground-to- 
air missiles 

Ground (22) 
Air (21) 

G (air-to- 
ground 
missiles) 

Exact matching of terms (i.e., 
no absent terms, no super- 
fluous terms), but a stronger 
relationship than mere co- 
occurrence or juxtaposition 
must be shown. 
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Table 3. Examples of Coordination Failures Using: Pure Coordinate Indexing 
(Selection and Juxtaposition Only of Terms Occurring in Text) (Cont.) 


Type of 
Coordination 
Failure 

Example 

Request 

Search 

Terms 

Documents 

Retrieved 

Reason for Failure 

6. Failure to a) 

retrieve 
“next best" 
documents 

(Refer to 

Table 1) 

3, 6, 4, 31, 32 

NONE. The 
requester, 
however, 
might find 
it advanta- 
geous to 
examine 
document B, 
which has 
three of the 
search terms 
(3, 4, 6) 

No facility for pulling docu- 
ments with at least some of 
the search terms (optimally, 
the most terms) ; lack of 
browsing capability. 

b) 

(Refer to 

Table 1) 

3, 31, 32 

still NONE 

Same problem as above ; 
merely shrinking the request 
does not necessarily improve 
the retrieval. If a searcher 
began a process of successive 
elimination of terms, in the 
given example he could make 

as many as + (^) ~ 

attempts before getting a 
“hit." This would only be 
practical on computers and 
even then is not the most effi- 
cient approach. 


Physical Proximity 

A primitive kind of association between 
words in a text is the distance in terms of words 
that separates them. This association is 
thought by some to be useful in machine trans- 
lation as a word and its dependents tend to be 
grouped together, but it seems to have little ap- 
plication to indexing. It could serve a useful 
analytical function if index terms were tagged 
according to the part of a document in which 
they occurred, such as title, abstract, and main 
body. The relative importance of these parts 
could then be analyzed. A related study has 
been made by workers at IBM. Resnick^® and 
Savage report that the results of indexing from 
an abstract and from the title of a document 


are not significantly different. It should be 
realized, however, that the IBM indexing 
studies are based on a comparison of indexing 
terms with keywords supplied by users of a 
Selective Dissemination of Information (SDI) 
System. A certain bias is thus already inherent 
in the tests. 

Synonyms 

Words with similar meanings can be handled 
in the input by using “see'' references. In a 
descriptor system, all words considered near 
enough in meaning are mapped into a single 
term (descriptor) via a subject authority list or 
they are referred to a thesaurus. The process 
can be done by the human searchers or, prefer- 







544 proceedings— SPRING JOINT COMPUTER CONFERENCE, 1964 


ably, can be handled entirely within the system 
dictionary. If the system is to serve its users 
with a minimum of inconvenience, the con- 
straints imposed on the users and indexers must 
be reduced to a minimum. Humans should be 
free to make decisions (formulate requests, 
select keywords). The system should unburden 
them of the processing and matching opera- 
tions. 

Partial Implication or Near-Synonyms 

This problem is much like the preceding one 
except that now provision is made for generic 
groupings of specific terms and specific filials of 
generic terms. These are usually handled by 
‘‘see also” references. In other words, some 
documents are filed under the given term but 
other potentially relevant documents may be 
found under related terms. This technique in- 
corporates into the system the main advantages 
of hierarchic arrangements. 

Semantic Relationships 

As a rule, the association of index terms 
should not be stressed too heavily. It is some- 
times helpful to show directional relationships 
in answer to the question “Does A affect B, or 
does B affect A?” For example, the terms “dif- 
fraction” and “ultrasonic” could pertain either 
to ultrasound used to diffract light or to the dif- 
fraction of ultrasound. Even the expression 
“utrasonic diffraction grating” can be ambigu- 
ous in this respect. 

The occurrence now and then of such false or 
ambiguous correlations led to the creation of 
associative links that connect various terms 
from a document in one group, other terms from 
the document in another group, and so on. To 
further clarify the meanings involved, semantic 
roles were devised to delineate the function of 
a given term within a given document.^^ 
foregoing ultrasonic diffraction grating prob- 
lem could be handled nicely by assigning “ultra- 
sonic” a role indicator telling whether it is 
“acted upon” (by an acoustic grating) or “act- 
ing upon something” (light). 

A present working system that uses roles and 
links extensively is the one serving the Ameri- 
can Institute of Chemical Engineers.^® How- 
ever, such devices are best suited to such highly 
structured disciplines as chemistry since it is 


important for the chemist if he is interested in 
a given chemical as a catalyst not to be bothered 
with documents on this chemical as a reagent, 
dye indicator, or insecticide. 

Another device is to join words inseparably 
as bounds terms (precoordination). A noun 
and its descriptive adjective might be treated 
as a single concept. This has both good and bad 
consequences. It requires a well-tutored in- 
dexer and means that a search for one of the 
terms will not, in general, pull a document in- 
dexed with the bound term. In particular, how- 
ever, it is useful for concepts such as chemical 
compounds (especially in a chemical library) 
or the occasional ambiguous term like air-to- 
ground missiles. It is beneficial for pairs of 
terms that always coordinate. 

Relationships between words are invoked in 
the hope of decreasing false coordinations. But 
an exaggeration of their importance is not de- 
sirable, for they can simultaneously exclude 
relevant documents. A good system will com- 
bine the various types of coordinate indexing, 
avoiding extremes but using relationships spar- 
ingly. 

Probabilistic Indexing 

Indexing, like searching, may be thought of 
as a binary operation. Every word in the text 
of a document is either chosen as an index term 
or it is not. In a search, a document is either 
picked (assumed relevant) or it is not. This 
means that when a request is made, the user has 
reason to be wary of the output. He may get a 
handful of documents, most of which will hope- 
fully be relevant to his request. Or he may be 
handed a list of several hundred accession num- 
bers. He may then undertake the project of 
scanning the documents or narrow his request. 
Then he may exclude documents of interest. All 
this because the system called every document 
either good or no good. 

Probabilistic indexing, proposed by Maron 
and Kuhns,24 provides for the assignment of 
weights to index terms and the ranking of re- 
trieved documents according to some criterion 
of relevance. Instead of being chosen on a zero- 
one, go or no-go, basis, the words of a document 
are weighted by the indexer according to their 
significance to the documents. Maron and 
Kuhns, in their own experimental work, use an 
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eight-level rating, i.e., 0, Vg, 1. 

Relationships are also introduced, but on a 
purely statistical basis relying on the frequency 
of occurrence of word pairs in all the documents 
of the collection. The most promising one of 
several mathematical devices used to relate the 
word pairs is a coefficient of association. 

The coefficient of association is, as described 
by the authors, essentially a measure of the 
excess of joint occurrences of a given pair over 
expectation based on random co-occurrence. 

An alternative to Maron’s ‘‘excess'’ was pre- 
sented by Stiles at the recent symposium on 
materials information retrieval attended by one 
of the authors.^^ The chi-square test used to 
correlate physical experimental data is applied 
to word co-occurrences. 

Probabilistic Indexing Experiment 

A controlled experiment was conducted with 
110 articles, selecting and categorizing key- 
word^ then working backward to coordinate 
documents and categories. In this way, eventu- 
ally, for every request there was an answer 
document which, when retrieved, satisfied that 
request. Conventional (binary) and probabilis- 
tic techniques were compared, and the results 
were expressed in terms of the number of re- 
trieved documents that had to be read before 
hitting the answer documents. 

It turned out that the conventional system 
would require the user to read approximately 
thirty percent more retrieved documents to 
obtain the same number of answer documents 
as opposed to the basic selection process of 
probabilistic indexing. Considerable improve- 
ment was achieved with elaboration of the basic 
selection process. 

Proba bilistic Indexing — Critique 

Indexer subjectivity is greatly increased. The 
decision-making process would seem more diffi- 
cult, but it may subsequently prove to be worth 
it. According to the authors, the work of the 
indexer is facilitated; as it now stands, he is 
too constrained by having to say yes or no to a 
possible index term. 

The ability to make too fine a breakdown of 
the weighting scale from 0 to 1 is questioned. 
An analogous problem arises in education when 


a teacher has to grade pupils in a highly sub- 
jective field. In probabilistic indexing a scale 
of 0 to 3 (or 0, Vsy 1) would seem to be 
the maximum. However, the psychometric 
problem of scaling is not an easy one to resolve. 

The premise that one can go from index 
terms that are highly significant of a document 
to documents that are highly relevant to a 
request is accepted with much reservation. It 
is probably a correct premise in most instances, 
but it overlooks the situation in which a request 
will be quite well and perhaps best satisfied by a 
document in whose over-all text the search 
terms play a relatively minor role. 

The old subject-classification problem — ^to 
describe the subject of a document — does not 
wholly describe its information content. 

There are many formulations of the informa- 
tion retrieval problem in terms of statistical and 
information theory, but most of them skirt the 
problem. In fact, the circumstances are often 
tantamount to a search for the exact solution 
to a problem when most people are not sure just 
what the problem is. 

The probabilistic indexing approach, on the 
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eventual over-all mechanization of IS&R. This, 
Maron's later work on automatic indexing and 
Edmundson and Wyllys' insight into the true 
implication of information theory for IS&R, 
seem to be paths to the library of the future. 

Probabilistic indexing is still a manual index- 
ing procedure. It appears to be the natural 
transition from manual to automatic indexing 
because as mechanized statistical methods of 
selecting terms from a document come more and 
more into use, probabilistic indexing will pro- 
vide a richer foundation on which to build. 

MECHANICAL INDEXING 

Any discussion of mechanized indexing should 
be prefaced by citing the single major limita- 
tion that prevents any of the elegantly devised 
theoretical schemes from being put into prac- 
tice. This is the requirement for total text 
input. The machine processing of total text is 
not efficient for anything but experimental 
analysis as long as printed text must be trans- 
lated into machine-readable text (via key- 
punching, for example). Working automatic 
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indexing systems, like machine translation sys- 
tems, must await the perfection of print-read- 
ing and character-recognition devices. 

Mechanized indexing is attacked from two 
different angles, statistically and linguistically. 

Statistical 

The common basic assumption of statistical 
selection processes is that the linguistic prop- 
erties and laws of literature are describable in 
terms of populations and frequencies of word 
occurrences and their associations. The diifer- 
ence between statistical description of language 
and statistical thermodynamics is basically 
quantitative, but this quantitative difference is 
sufficient to manifest itself in a qualitative form. 
In general, the numbers treated by thermody- 
namics are larger and the variables fewer than 
in linguistics. The result is that given a norm 
of behavior in linguistics, the deviations from 
the norm are more pronounced than in thermo- 
dynamics. There is little evidence in the physi- 
cal world of less than maximally probable 
events. In language, on the other hand, excep- 
tional occurrences are witnessed regularly. 

The problem confronting mathematical lin- 
guistics, then, is to define the variables and to 



Figure 2. Idealized Frequency Count of Words Oc- 
curring in a Document Entitled “The Ultrasonic Ac- 
celeration of Diffusion.” 


refine the statistical behavior laws narrowing 
their specification and reducing the undefinable 
to a minimum. 

The fundamental concept of statistical key 
word selection is illustrated in Figure 2, which 
is designed to convey a graphic impression of 
the concept and is not necessarily the only pro- 
cedure for implementing the concept. The dia- 
gram represents a smoothed frequency count of 
the word occurrences in a document as a func- 
tion of the rank order of the words. This is 
statistical selection in its crudest, most unre- 
fined form. It flatly presupposes that common 
words will occur most frequently, the esoteric 
but unpertinent words most infrequently. The 
middle-frequency words — those remaining after 
the extremes of the curve have been removed — 
are considered to be the most significant. The 
same process can then be applied to pairs of the 
significant words. 

In general, this first-approximation procedure 
screens out the many clearly irrelevant words 
leaving a working body that can be treated by 
more precise techniques. 

Examples of Statistical Selection 

Three proposals for automatic, statistically 
based indexing merit attention. One has many 
ramifications in use today; the other two are 
still in the ivory tower having been subjected 
only to carefully controlled experiments. 

Auto-encoding 

The pioneer study of mechanized index- 
ing, 22.23 which led to the key word in context 
(KWIC) index, relies on the computer recogni- 
tion of individual words and counts their fre- 
quency in a text. Eliminating the common 
words, the most commonly occurring topical 
words are used (without further discrimina- 
tion) as index terms. The product is a mechani- 
cally prepared concordance. 

Recognizing the limitations of word indexing, 
Luhn has standardized the vocabulary by com- 
bining words with the same root and then 
combining the counts of synonymous words. A 
normalized form is selected after being looked 
up in a thesaurus. Relationships are handled 
by computer analysis of word pairs in cases 
where significant words occur together. 
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The search terms are developed analogously 
from an essay-form request by matching search 
terms and index entries. 

Automatic Document Classification 

Academic-level arguments to the contrary 
notwithstanding, there is much to be gained by 
a scheme for rapidly and automatically fitting 
documents into sensible categories. The prob- 
lem facing library practitioners is particularly 
acute in this respect, because while those slightly 
removed from the bookshelves debate the com- 
parative merits of classification versus docu- 
ment-independent organization of information, 
these people are pressed to shelve their docu- 
ments in a rational fashion. Besides, as Borko 
argued so convincingly at a recent linguistic 
seminar,® people (scientists included) tend to 
think naturally and with facility in such an 
ordered way, i.e., categorically. The conflicts 
arise when the flexible adaptive human psycho- 
logical system confronts rigid, long-standing 
systems solidified on paper. 

A happy solution appears forthcoming in au- 
tomatic classification. Work has been done in 
this area by Maron^^ and Borko et al.® ® In 
both instances the researchers rely on statistical 
correlation between words and documents, and 
between words. The two experiments are simi- 
lar in several respects. For example, documents 
are searched by computer for the occurrence of 
specific clue words relating to specific a priori 
subject heads, or categories. The essential dif- 
ference lies in the derivation of the categories 
and assignation of the documents. Maron de- 
vises his own categories and assigns the docu- 
ments by means of a Bayesian prediction 
formula. Borko, on the other hand, derives his 
categories by matrix correlation of the clue 
words, extraction of eigenvectors by factor 
analysis, and rotation of these eigenvectors to 
find the best categories. Documents are then 
assigned by factor score prediction techniques. 

Borko has done considerable work in the last 
couple of years in comparing his own and 
Maron’s work. A related effort in document 
association by linguistic analysis has been un- 
dertaken by Salton at Harvard.^^ 

Automatic Indexing (Maron) 

Maron’s scheme^® for automatic indexing 
operates by generating a priori subject heads 


and searching documents by computer for the 
occurrence of these subject heads (called clue 
words). The intention is a good one, to ulti- 
mately merge probabilistic and automatic in- 
dexing. 

Automatic Indexing (Edmundson and Wyllys) 
— Relative Frequency Approach 

This technique^® represents a new concept 
in the statistical analysis of text and comes 
closest to bridging the gap between IS&R and 
information theory. Instead of treating a docu- 
ment as the universe of words as in the primi- 
tive diagram in Figure 2, the frequency of a 
word in a document is compared with the 
frequency of the same word in general use. 
Various statistical criteria are suggested and 
evaluated. The upshot is that a word that is 
rare in general use but frequent in a document 
is a significant measure of that document’s con- 
tent. This corresponds to the surprise element 
of information theory. 

Citation Indexing 

Several studies have been made in recent 
years using the bibliographic materials and 
Citations contained in documents either directly 
for cross-referencing purposes or as a statisti- 
cal vehicle.'2’‘^'2o,33 These techniques exploit the 
dynamical historical evolution of science and 
the language used to express it. One of the 
authors (Wood) is working on interlingual 
citation statistics in Russian and English acous- 
tics journals in an attempt to derive systemati- 
cally from current terminology an accurate 
technical terminology for machine translation 
and information retrieval from foreign lan- 
guage references. 

Linguistic 

The creation of indexes by automatic linguis- 
tic analysis penetrates deeply into the realm of 
machine translation (MT). In this case, it calls 
for the automatic translation of the source 
language into a target IS&R language from 
which indexes can be created. Present MT ef- 
forts are aimed at syntactic analysis, of which 
there are two types : 

1. Immediate-constituent or phrase-struc- 
ture analysis — the segmentation of sen- 
tences into successively smaller machine- 
tractable parts. 
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2. Dependency analysis — ^the assignment of 
a governing word to every word in a sen- 
tence (or other lexical unit), i.e., the cre- 
ation of a complete dependency structure. 

Further elaboration of linguistic analysis is 
beyond the scope of the present survey. A clear 
description and reconciliation of the two ap- 
proaches are given by Hays.^^ Klein and Sim- 
mons actually implement both procedures simul- 
taneously Finally, a thorough state-of-the- 
art treatment of syntactical analysis was pre- 
sented by Bobrow at the last Joint Computer 
Conference.^ 

Extension of Concepts 

A desirable future system would unify auto- 
matic statistical indexing applying the relative- 
frequency approach of Edmundson and Wyllys 
as a criterion for assigning weights to index 
terms as required in probabilistic indexing. 

A certain amount of skepticism is evoked by 
the assumption that statistical selection will 
create the best indexes because some of the 
problems are semantic. However, it will prob- 
ably create the best automatic indexes for some 
time to come. 

Linguistic analysis has a serious drawback 
if used to find a common language for IS&R and 
MT in that there is a major difference of goals 
between IS&R and MT. In MT, the user sup- 
posedly has a document, wants to know what it 
says, and is therefore interested in fidelity of 
translation. In IS&R, the user is interested in 
finding a peculiar (sic) type of information and 
wants the documents that contain it regardless 
of how it is presented. On the other hand, 
linguistic analysis will serve IS&R advantage- 
ously if used to learn how people think and 
communicate ideas. 

An intermediate step to probabilistic index- 
ing would be indexing on two levels, viz., terms 
which are indicative of what a document is 
about and other terms pertaining to items of 
information not immediately contained in the 
topic. In other words, the following assigna- 
tion of values would be made to every word in 
the document : 

0 — Nonsignificant word 

1 — Significant nonindicative word 

2 — Significant indicative word 


Indicative is interpreted to mean that the word 
is directly related to the theme of the document. 
Such distinctions are psychologically facile for 
human indexers. 

STORAGE AND RETRIEVAL 
PRESENTATION OF INDEXES 

If an automatic literature search is conducted, 
the user will, in general, never see the index. 
He will make a request, and the system (via 
librarian or computer) will do the search for 
him. 

There still remains a strong need for the user, 
on occasion, to see the index. The forms of 
index presentation, all of which can be readily 
produced by machine, are card form, book form, 
and pictorial display. 

Card Form 

Every library user is familiar with the stand- 
ard card catalog. The cards may represent units 
of a document file, in which case one card con- 
tains all the printed information pertinent to 
a document. They may be units of an aspect 
system, each card representing an index term 
and containing a list of document numbers. 
They can also be punched or otherwise marked 
for manual or machine searching, and they may 
have a window with a microfilm of the docu- 
ment or abstract (aperture card) . 

Book Farm 

Conventional and Rotated 

Book-form indexes stem from the familiar 
list of topics at the back of almost every non- 
fiction book. The frustration often engendered 
by most book indexes led to improvements, such 
as the extremely detailed and cross-referenced 
index of Chemical Abstracts. An advanced de- 
velopment is the rotated, or permuted, index. 
Strings of key words, such as the significant 
words of titles or auto-abstracted sentences 
(KWIC), are listed several times, alphabeti- 
cally according to each word in every such 
string. Many experimental machine-generated 
index printouts of this type are now in use and 
are proving to be very effective. For example, 
the RotaForm Index, now a regular part of 
Index Chemicus, is a computer-generated mo- 
lecular formula index that repeats and rotates 
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a chemical formula for every chemical element 
that it contains. 

Columnar 

An attempt to present coordinate indexes of 
the Uniterm type in book form is the columnar 
index, such as the Scan-Column Index and 
Tabledex. These indexes print adjacent col- 
umns of document numbers under word num- 
bers and are rapidly scanned to locate docu- 
ments with a large number of desired key 
words. The limitations for an untrained user 
are pretty obvious. Clearly, the fatigue factor 
would be high. 

The book-form index is, of course, a closed 
index; once it is printed, terms cannot be en- 
tered or deleted. 

Pictorial Display 

The ability to visualize a coordinate index in 
Gestalt form is much to be desired. It has the 
added advantage that it can, in principle, be 
viewed on the same screen as the document or 
abstract itself. 

Termatrex and Minimatrex 

Conceivably, one might project any ^ book- 
form index on a screen. But the displayed in- 
dex used to great advantage by a number of 
firms today is the Termatrex system. Document 
accession numbers are located on the nodes of a 
coordinate grid that is set against an illumi- 
nated background. When a number of Terma- 
trex cards, each one representing a key word, 
are superimposed, the light will show through 
the nodes that are common to all of the key 
words. In this way, the user can get a rapid 
over-all glimpse of how many documents he 
would have to inspect as well as their accession 
numbers. 

Also under development by Jonker Business 
Machines, the creators of Termatrex, is Mini- 
matrex, an improved, miniaturized, mechaniza- 
ble version of Termatrex. The coordinate grids 
are photographed on film strips by grouping 
broad categories of terms onto corresponding 
frames of the strip. In its present form, the 
Minimatrex strips have 8 frames, each one a 
lOO-by-100 matrix representing 10,000 docu- 
ments. As many as 12 strips can be superim- 
posed, and the resultant retrieval pattern is 


projected on a viewing screen. The number of 
frames and superimposable strips is expandable. 

Hypothetical Systems — 

The Semantic Road Map 

There is one hypothesized system which, al- 
though far too advanced conceptually to be op- 
erational even in the near future, deserves 
special attention because it foresees the possi- 
bilities of a truly automated era. Loren Doyle 
of SDC^ gives an intriguing picture of a 
space-age library system. Giving credit to the 
relative-frequency notion of Edmundson and 
Wyllys,!^ Doyle seeks to '‘increase the mental 
contact between the reader and the information 
stored so the reader can proceed unerringly 
and swiftly to identify and receive the message 
for which he is looking.’’ 

The semantic road map idea recognizes the 
human proclivity to organize concepts in spatial 
relationship to each other to allow the free 
evolution of concepts, one as the outgrowth of 
another, linking the retrieval process more in- 
timately with the process of natural creative 
thought. 

Doyle provides a scheme analogous to the one 
by which we would find an apartment number, 
given an address number, street, zone number, 
town, county, state, and country, none of which 
were familiar to us. We would begin with a 
reasonably compact map of the world and pro- 
ceed until we had a houseplan of the apartment 
building. 

The projected library user sits before a con- 
sole with a panel of control buttons and a view- 
ing screen. He begins with a map of the con- 
tents of the library whereon he views concepts 
whose mutual associations are indicated by 
proximity, heavy lines, thin lines, dotted lines, 
large letters, small letters, and arrows that 
point toward distant cousins. Through a rapid 
succession of such maps, he homes in on the 
message for which he seeks. 

The documents themselves are in the form 
of "document proxies,” or abstracts in map rep- 
resentation similar to the search maps. This 
permits rapid scanning of a large number of 
document abstracts to find those worth reading. 

The maps are, of course, automatically gen- 
erated. One set of techniques for realizing such 
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a system is given along with the description of 
a preliminary experiment. 

Don Swanson,^® Dean of the Graduate Li- 
brary School, University of Chicago, gave a 
somewhat similar man-machine concept for a 
library request console tied into a storage and 
control system. However, his discussion, unlike 
Doyle’s, was centered on console-library request 
techniques as a feedback link which is part of an 
on-line search process system for future auto- 
mated library systems. 

Savage emphasizes that to describe the effec- 
tiveness of a system in terms of magnitudes and 
system parameters (number of books in a li- 
brary or number of descriptors in a computer 
memory) does not tell how well the system 
works. What is needed is a standard criterion 
of relevancy, one that is as useful as miles 'per 
gallon for measuring automobile economy. 

Savage characterizes human evaluations in a 
somewhat more precise way than is usual, 
points to some of tfie difficulties of ever generat- 
ing a metric measure of system performance, 
and suggests the direction of study to obtain 
more accurate measures. 

It is important to realize that any metric cri- 
terion of adequacy will have to be tested by way 
of obtaining feedback froA the'users of a real 
system. * ‘ 

MECHANICAL SEARCHES 

The original mechanized searches were made 
by serially operating machines, which operate 
most efficiently by reading the whole store to 
find what is needed. Serial searching is still 
advantageous when the store can be broken up 
and grouped (as in multiple-punched card 
decks) ; input, updating, and purging are also 
simpler then. 

By its very nature, however, serial searching 
is less efficient than applying the lookup prin- 
ciple, which is feasible with random access 
machines having large memories. This princi- 
ple also allows for indefinite expansion of a 
large file. 

SERIAL AND RANDOM ACCESS 

The factors which must be considered in a 
comparative evaluation of serial and random 


access techniques (search and lookup) are file 
size, length of search, economy of money and 
time, effort expended, and the inherent effi- 
ciencies of retrieving a given type of informa- 
tion. 

In the face of the growing number of avail- 
able computer systems, the thorough-going and 
detailed evaluation of any systems or combina- 
tions of equipment presents an enormous task. 
An interesting and original approach to the 
problem by automation of the selection of sys- 
tems suitable to a particular need is given by 
Arnovick.i Essentially, a complete list of sys- 
tem component characteristics and ancillary in- 
formation is generated and stored in a com- 
puter. A complete set of application require- 
ments is compiled and matched against the 
system information stored in the computer ; the 
computer analyzes the application information 
and synthesizes ^n optimum general system. 

LOGICAL OPERATIONS 

It is convenient to think of searches as com- 
prising two kinds of logical operation, conjunc- 
tive and disjunctive. Searches are narrowed by 
the former because taking the logical products 
and differences of terms (AND, AND NOT) 
makes the request more specific. Disjunctive 
operations are useful for broadening the search ; 
the logical sum (OR) makes for greater varia- 
bility. Disjunctive operations and combinations 
of disjunction with conjunction (logical prod- 
ucts of logical sums, etc.) are especially helpful 
for processing request terms in a machine- 
stored thesaurus or otherwise pinpointing a 
request. 

It has been pointed out that although Boolean 
functions do, in fact, accomplish the stated 
effects, from the systems optimization stand- 
point, they do not do so efficiently. It is demon- 
strated that Boolean combinations will yield 
either too much irrelevant material or too little 
relevant material.^^ 

In the attempt to find a more flexible and 
optional treatment of the retrieval problem us- 
ing keywords, Del Ballard (formerly of RCA) 
has developed a new approach to search logic. 
The technique is one that is easily implemented 
on a general-purpose computer and provides 
the following : 
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1. Retrieval of ''best available’' documents 

2. Ranking of documents by the degree to 
v^hich they match a request in terms of 
number of keywords 

3. Upper and lower bounds on either the 
number of keywords or number of docu- 
ments 

4. Sensitivity to various attributes of key- 
words, e.g., rank, order, numerical mag- 
nitude, ranges of values 

5. Arrangements of search terms into groups 
that fit the requester’s thought process, in- 
cluding nesting of groups 

It is stressed that the grammatical problem is 
circumvented by grouping concepts without 
specification of the relationships between them. 
It has proven to be a pliable, rough, and ready 
tool suitable for the general practitioner in in- 
formation retrieval. 

MANUAL — DEDICATION OF SPACE 

A systematically arranged index can be 
searched by proceeding according to the rule of 
systematization as in the standard card or book 
catalog. The rule may be simple, as in alpha- 
betic filing, or it may rely on a certain amount 
of prior knowledge on the part of the searcher 
as in chemical and biological hierarchies. In 
more recent developments, such as the permuted 
index and manual coordinate indexing cards, 
the searcher is guided more rapidly to a specific 
location of information. 

A technique that is both efficient and adapta- 
ble for eventual machine searching makes use 
of the Dedication of Space principle. In essence, 
a particular geometric point or region is used 
to represent an accession number. The match- 
ing of like points for several terms yields docu- 
ments containing those terms. This is a form 
of the coordination principle. The best known 
example is the peek-a-boo card, one version of 
which is a standard punched card of the IBM 
type coded so that each space represents a docu- 
ment number. Matching is achieved by aligning 
several term cards and noting the document 
numbers (holes) through which light can be 
seen. If no documents are found in a search, 
terms can be eliminated by trial and error until 
a "see-through” occurs. In computer searches, 


it is possible to generate a "minimum search 
criterion” which, for a given set of terms, will 
yield those documents containing the largest 
number of terms in the set. 

STORAGE MEDIA 
Paper Media 

Paper holds a certain record for longevity as 
the prime medium for recording information. 
That it can be fairly permanent has been dem- 
onstrated by the Dead Sea scrolls. It will con- 
tinue to be prominent for some time, although 
in recent decades it has taken unprecedented 
forms, such as punched cards and paper tape. 

An interesting speculation presents itself. 
Although digital information can be stored on 
paper, the technique of sensitization does not 
make use of the intrinsic properties of the me- 
dium, i.e., holes are punched or magnetic ma- 
terials are deposited. Researchers in the Soviet 
Union have uncovered a slight piezoelectric 
sensitivity in wood which may indicate possi- 
bilities for the use of paper. 

In general, cards and paper tape are used 
either as storage media for small manual or 
semiautomated systems or as the input miedium 
for large automated systems. Punched cards 
and punched paper tape executed by automatic 
writing equipment are used for the latter. Small 
systems use peek-a-boo cards, keysort cards, 
edge-notched cards, Uniterm cards, or modifica- 
tions of these. 

Magnetic Media 
Tapes 

Magnetic tape is extremely well suited to the 
long-time storage of large quantities of data in 
machine-language form. It is also one of the 
most rapid search media. For this reason, it is 
used most extensively as the peripheral mem- 
ory of computers and, therefore, for the storage 
of older, less frequently needed information. Its 
nature is generally more permanent and less 
flexible than other magnetic media. 

Because it is most adaptable to digital data, 
magnetic tape usually works best for the stor- 
age of index terms and document numbers. 
There are times when it is advantageous, ac- 



552 PROCEEDINGS— SPRING JOINT COMPUTER CONFERENCE, 1964 


cording to some people, to record entire text in 
machine language on tape ; but this is not con- 
ceivably practical except for data. However, 
there is a system called VIDEOFILE (an RCA 
development) which stores documentary and 
pertinent bibliographic information using video 
techniques. The document image is converted 
to a video signal and is stored with an index 
number. It has the advantage of furnishing 
rapid and direct hard-copy printout and is 
amenable to communication media. 

Drums 

Magnetic drums are better suited than tapes 
for the internal memory of a computer because 
of their more rapid (many orders of magni- 
tude) random access. Being a lookup device, 
the drum is well adapted to storage of the in- 
ternal functions of a computer, such as dic- 
tionaries, thesauri, and program instructions. 
Moreover, it is the cheapest type of internal, or 
main, memory. 

Discs 

Another dimension is added by the use of disc 
memory units, which have characteristics simi- 
lar to drums but with a considerable gain in 
access time and storage space. Given an ad- 
dress, the information is located on a certain 
disc in the stack, a certain sector of the disc, and 
a certain band within the sector. 

Magnetic Cores 

Magnetic core arrays provide still faster re- 
trieval, but the expense per unit storage space 
is the highest of all internal memories. Cores 
work well in conjunction with magnetic tape 
as the peripheral storage media. 

Cards 

Magnetic cards need to be searched serially, 
but they have the advantage of flexibility over 
tape with respect to additions or deletions. 
However, the search rates are slower. The cards 
are usually stored in cartridges and may be 
used in conjunction with film chips, in which 
case the identifying codes are in magnetic digi- 
tal form whereas the information is in the form 
of a photographic image. Magnetic coding is 
sometimes used in conjunction with cards con- 
taining the printed information. 


Optical Media 
Microfilm Reels 

Most of the forms of magnetic media have 
their photographic analogs and the same com- 
parative advantages. Photographic media have 
the main virtue of storing information in a 
form ready for immediate availability (docu- 
ment images). This is particularly suitable for 
drawings, abstracts, and texts which need not 
be read by machines but are accompanied by 
machine-readable codes. 

Like magnetic tape, continuous microfilm 
must be scanned serially and is not easily up- 
dated or otherwise altered except by the cum- 
bersome process of editing and splicing. 

In one way, film is less flexible even than 
magnetic tape because it lacks the erasing fea- 
ture. However, for storage of nondigital mate- 
rials to be displayed and read by people (as 
opposed to processable data), this slight differ- 
ence is more than offset by the possible elimina- 
tion of the middle step of the sequence: (re- 
quest)-^ (index storage) (document storage). 
The last two steps can be combined when film 
is used. 

Microfilm Strips 

A natural way to introduce flexibility into 
microfilm storage is to cut the film into short 
strips. This approach is used in the nonconven- 
tional application of Minimatrex, a system for 
the projection of a peek-a-boo type coordinate 
index onto a screen. The system is discussed 
under Indexing, Presentation of Indexes. 

Microfilm Chips 

The usual technique for breaking up micro- 
film is to store separate frames containing, as a 
rule, from one to four reduced document-image 
pages. The chips are generally stored in car- 
tridges. The fully automatic handling of such 
devices is expensive, but a compromise is often 
effected by providing efficient manual retrieval 
of the cartridges and automatic retrieval of 
the chips. 

Aperture Cards 

More than satisfactory results have been ob- 
tained in some cases by combining film with 
other types of media. For example, film chips 
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containing document images can be inserted in 
magnetically coded cards (Magnacard system), 
providing a compact and flexible file-storage 
complex. 

Another highly efficient combination is film 
and paper, i.e., punched cards of the IBM type 
with microfilm inserts. The cards are compara- 
tively inexpensive; search is relatively swift; 
the file is flexible (easily updated and purged) ; 
and it affords all the convenience of having the 
index, document store, and retrieval system in 
one location. 

Specialty Devices 

IBM is developing a photoscopic disc capable 
of storing up to a billion bits. The speed of 
access is about the same as that of magnetic 
drums, but the large capacity makes it applica- 
ble for the kind of large storage lookup required 
in the processing of natural languages and ma- 
chine translation. There is an erasure problem, 
which is minimized by coupling with a smaller 
auxiliary magnetic drum. 

MIT is developing the Photomemory, a com- 
plex of 4- by 5-inch photographic plates, each 
of which can store 5 million bits of information. 
Reports on access time are inconsistent; gen- 
erally, it appears to be a medium access time 
device. Access is sequential, density is high, 
and readout is rapid. The photographic plate is 
scanned by a rotating mirror which projects the 
image past a bank of photosensitive cells. 

An electron optical device based on an effect 
called thermoplastic recording is under devel- 
opm.ent. Still in the laboratory stage, it makes 
use of the properties of electron beams which 
have low deflection inertia, high energy, and — 
most important — high resolution. Information 
is stored as a pattern of electric charges, the 
field of which deforms the thermoplastic sur- 
face of a film upon subsequent heating. This 
means that signals can be stored with a resolu- 
tion much less than the wavelength of visible 
light. To cope with the amplitude-variation 
rather than density-variation form of the re- 
corded information (surface ripples), an opti- 
cal schlieren system is used to read the image. 
The potential impact of this very advanced de- 
vice on ISR is not yet evident. 

Other devices, intended for use primarily in 
associative, or content-addressed, memories are 


neuristors, cryotons, and fiber-optical devices. 
Associative memories tend to be slower in arith- 
metic operation than in conventional storage 
units. However, because of the capability of 
direct access to content rather than via a loca- 
tion code, the potential for nonnumerical infor- 
mation processing is promising. 

DISSEMINATION 

One avenue of communication between peo- 
ple and information is retrieval, i.e., letting 
scientists come to the information and helping 
them locate what they want. Another way of 
augmenting this communication is to seek ways 
in which important information can be chan- 
neled to people who might need it. The em- 
phasis here is on “might'" for there is no way of 
knowing with assurance who will benefit by 
what. 

Most of the details of automatic Selective Dis- 
semination of Information (SDI) have been 
supplied by IBM.^'^ While it is agreed with IBM 
that selective dissemination must be an essen- 
tial part of any technological organization, 
there is no assurance that it will promote nearly 
perfect matching between need and answer to 
the need. 

DIGRESSION ON SCIENTIFIC DISCOVERY 

Richmond, writing on the relation between 
information retrieval and scientific method and 
issuing a plea for more recognition of the 
latter,^2 stresses the role of fortuity in scien- 
tific discovery. She sites several historical ex- 
amples of scientific events and, generalizing, 
describes two such events in which the cards do 
not fall to the researcher's advantage: near 
discovery, where a scientist hovers for years 
on the threshold of a breakthrough, working all 
around it but failing to unlock the solution; 
premature discovery, where a scientist dis- 
closes a major phenomenon either without 
realizing it or in the face of an over-all state of 
the art that is not ready to accept his discovery. 
One reason for the latter is that what may later 
prove to be an immensely important application 
is not at all apparent at the time. Some eminent 
discoveries have at the time of conception been 
dismissed as interesting novelty or at best “vital 
to our understanding of the universe about us 
but of no conceivable utility." 
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Sometimes rediscovery in a sympathetic en- 
vironment carries more weight than the origi- 
nal discovery, which is afterwards forgotten 
except by the probing researcher who might 
have unearthed it from the musty archives of 
‘‘interesting novelties/’ 

What is involved, of course, are those most 
complex and very human patterns of thought, 
deduction and induction. Richmond expresses 
doubt that it will ever be possible to “ . . . con- 
vey deductive and inductive reasoning in suffi- 
ciently abbreviated form . . . or to . . per- 
form retrieval of the kind the historian per- 
forms ...” (i.e., identify a premature discovery 
as a discovery in light of subsequent advances) . 
One is confronted again with the lead that capa- 
bilities, in terms of equipment, have over hu- 
man understanding. 

Present information retrieval and dissemina- 
tion do not do the scientist’s thinking for him 
(and this is Richmond’s prime thesis) ; they 
give him written accounts of work somewhat 
similar to his own. Retrieval and dissemination 
will yield the expected far more often than the 
unusual, the reason being that the systems can- 
not draw meaningful relationships between any 
two heretofore dissociated things. 

Therefore, assuming a retrieval or dissemi- 
nation system operates on key words, if the 
words “aardwolf” and “zymosis” have never 
beep related, the system will not coordinate 
them (except to supply Webster’s Dictionary or 
the Encyclopaedia Britannica) . 

The system may provide separate treatises on 
each, but these will serve only to enhance the 
requester’s separate expertness on both items. 
As an instructive exercise, the system might be 
programmed for random coordination, in which 
case, assuming a typical 5000-keyword system, 
there is one chance in 12,497,500 that the words 
will be associated. Now a slight connection is 
seen between what machines can do and the 
human trial and error process and how infinitely 
more educated the latter. 

There is a need for further study into the 
ways in which scientists benefit by the work 
that has gone before. To accelerate and stimu- 
late the scientist’s learning process, this should 
be the task of a dissemination system. A con- 
crete approach would be the re-creation by 


simulation of some of the significant historical 
breakthroughs and discoveries as well as those 
that did not happen because the information 
was not available at the right time. The objec- 
tive would be to utilize the imposing edge held 
in retrospect today ov^r the researcher in his 
day. The task would involve computer simula- 
tion of the historical event, drawing on both 
antecedent and postcedent (via feedback) liter- 
ature and events. It is hoped that the outcome 
would include (1) a better understanding. of 
the creative thought process and what could be 
done to aid it, (2) a manifold improvement in 
efficiency of the simulated over the real process 
of discovery, e.g., re-creation of an order of 
magnitude sooner than the original occurrence. 
If successful, the profit would be a more efficient 
homing-in on future discovery via the literature 
route. 

The question of meeting user need is taken up 
again briefly under Storage and Retrieval. 

DESCRIPTION OF SELECTIVE DISSEMI- 
NATION OF INFORMATION (SDI) 

There are times when a scientist’s best dis- 
semination system is a good friend. Casual con- 
versation has often produced the vital key in a 
field so remotely diverse as to be otherwise 
overlooked. Such might have been the case 
when the problem of moving film with high pre- 
cision at high acceleration was solved by digital 
positioning techniques developed for an auto- 
matic drilling machine.^^ 

The problem of a formal dissemination sys- 
tem is to be as good a friend in a continuous- 
operation and systematic mode. A formal dis- 
semination system should always be aware of 
the scientist or engineer’s activity, keeping 
pace with it and alerting him to anything by 
which he may profit yet without wasting his 
time with documents of no interest. This is a 
formidable assignment. 

The usual approach of a formal dissemina- 
tion system is to screen potentially relevant ma- 
terials through the standard gradations of 
presentation, i.e., key word, abstract, and docu- 
ment. The key word operation is handled be- 
fore the relevant material reaches the scientist 
by means of a so-called interest profile. The 
interest profile concept is a useful one for match- 
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Figure 3. Flowchart of Selective Dissemination. 

ing operations in general. The profile is essen- 
tially a list of key words or data used to de- 
scribe objects to be compared, e.g., a person, his 
attributes, his information needs, his abilities, 
a document, or needs of an organization. 

Figure 3 illustrates the workings of a selec- 
tive dissemination system used to match infor- 
m.ation of incoming documents with user needs. 

FEEDBACK 

Document profiles remain fairly stable, while 
recipient profiles are subject to continual modi- 
fication through feedback. In the SDI system, 
response cards are provided along with ab- 
stracts of documents whose profiles suitably 
overlap the recipient profile. The recipient re- 
turns the response card with an indication that 
he (1) had no interest in the document, (2) 
found the abstract useful and sufficient, (3) 
found the abstract interesting and requests the 
total document. His profile is then updated on 
the basis of his responses. 

The depth of matching, i.e., the extent to 
which the recipient profile and document profile 
overlap, is a controllable parameter. 

EXPANSION OF SDI 

The skill profile now employed by Space and 
Information Systems Division is one example 
of the profile matching concept. The skill file 
can be searched when management wants to 
know who is knowledgeable in a given area. 


The system furnishes the name and location of 
the person whose particular talents most nearly 
correlate with those sought by the organization. 
The same principle can be applied to new proj- 
ects, matching the contemplated project profile 
with the profiles of projects already underway. 

ABSTRACTING 

Webster's definition, modified to fit docu- 
ments, in a certain sense exemplified the avowed 
purpose of the abstract: a text (as concise as 
possible) that comprises or concentrates in 
itself the essential qualities of a document or 
of several documents (generalizing the latter 
to a book of several chapters, state-of-the-art 
survey, blanket review, etc.) . 

A little investigation reveals that within the 
field of documentation there is not a clear oper- 
ational definition of abstracting to complement 
the conceptual definition above, and indeed sev- 
eral researchers are occupied with finding one. 
Some other things that need to be considered 
are the purpose and advisability of abstracting, 
standardization, and the desirability of stand- 
ardizing abstracts. These factors must be un- 
derstood before delving too deeply into the ulti- 
mate aspect of abstracting mechanization. 

PURPOSE OF ABSTRACTING — 
ABSTRACTING VERSUS INDEXING 

For the purpose of definition, it is impor- 
tant to differentiate functionally between an 
abstract and an index, a detailed descriptor 
index in particular. One can almost visualize 
a document being screened through several gra- 
dations of processing. At one end would be the 
single subject heading or ‘‘catchword" (Schlag- 
wort). At the other end of the range is the 
total document itself. In order of increasing 
depth of representation, the intermediate steps 
would be a bibliographic index, a complete doc- 
ument profile as used in selective dissemination, 
a more detailed informative index using de- 
scriptors or Uniterms, scope notes, a detailed 
index with roles and links, a so-called supplied 
title (a title written by the document proc- 
essor), a one- or two-sentence synopsis, a tele- 
graphic abstract, and a detailed summary or 
abstract. However, the user will not want to be 
bothered with more than three of these includ- 
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ing the total document. Under the proper cir- 
cumstances, the continuum seems to fall spon- 
taneously into the parts necessary for that set 
of circumstances. For example, a trade journal, 
well attuned to the needs of its readers, may 
give a list of one-sentence descriptions of worth- 
while reading matter (supplied titles) provid- 
ing the reader with sufficient stimulus to pro- 
cure the document. However, if a library is to 
get information to a large and diverse clientele, 
it cannot hope to cater to the individual user as 
the editors of a trade journal can to their read- 
ers. It cannot expect to handle an influx of up 
to a thousand documents per day, evaluate their 
content, and dramatize their highlights in the 
way that a trade journal can. Not knowing the 
specific needs of the user, the alternative ex- 
treme would be to give him every potentially 
pertinent document. Such a system would en- 
tail a disproportionate waste of time. 

The index and search techniques will help the 
user to find things which might be what he 
thinks he wants. The dissemination operation 
will help him receive things which might be 
what he had not thought of or did not know 
about. To conserve the user's time, abstracts 
of those documents with possible or latent in- 
terest are supplied to him. Therefore, an ab- 
stract is not an index, for an index indicates 
what a document is about, or it tells where the 
information is located. An abstract is not a 
document in the strict sense of the word for it 
does not tabulate data nor does it give detailed 
hookup and assembly instructions for an instru- 
mentation complex. An abstract is not identical 
with either an index or a document, but it in- 
corporates features of both. 

STANDARDIZATION OF ABSTRACTING 

Several studies are being conducted to find 
operational definitions of abstracting. The two 
mainstreams of effort are directed toward ex- 
amining the effect of various abstracting phi- 
losophies on requesters and their satisfaction 
and toward evaluation of different abstracts of 
the same or similar documents. Several param- 
eters are involved, and in most studies one or 
more of these are controlled. In one investiga- 
tion,^ the size of the natural language abstracts 
is held to a fixed percentage (10 per cent) and 
the work of different abstracters is compared. 


A unique part of the same investigation is 
devoted to the study of “term diagrams," which 
incorporate the “road map" idea of Loren Doyle 
(see Indexing) to show quantitative and con- 
ceptual associations between key words. These 
studies are aimed at eventual automation of 
document condensation. 

The American Institute for Research is con- 
ducting an extensive inquiry into the opera- 
tional definition of abstracting, the development 
of rules guided by task performance on the part 
of both abstracter and user, and the evaluation 
of efficiency.^ A survey is being made among 
abstracting services, publishers, and users to 
gather ideas from those experienced in the field. 
Simulated job conditions are set up to measure 
user comprehension and proficiency under these 
conditions, given both full and abstracted texts. 

AUTOMATIC ABSTRACTING 

The principal efforts in automatic abstracting 
are being carried out at Thompson-Ramo Wool- 
dridge, Inc. System Development Corp., Plan- 
ning Research Corp., and at International 
Business Machines. Without going into the de- 
tailed procedures for abstracting, the primitive 
approach is to first find statistically significant 
keywords (content words with a high frequency 
of o^icurrence) and then find those sentences 
which contain the greatest number of such key- 
words. The results so far are conspicuous by 
their lack of coherence ; partly because of habit 
patterns in reading, they seem to present dis- 
connected ideas with a loss of flow between the 
ideas. Of course, the abstracts suggest rather 
than condense or represent the document con- 
tents. Nevertheless, they are surprisingly good 
on occasion. The problems just mentioned have 
been instrumental in the inception of more prob- 
ing studies into the nature of human abstract- 
ing. 

CRITICAL COMMENT ON 
AUTOMATIC ABSTRACTING 

As an incidental preface, the importance of 
the abstract in general should be considered. 
It would be difficult to overemphasize the place 
of the abstract or document summary in the 
mechanized system of the future. Apart from 
its utility to the user in some form, it will be the 
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prime medium of the system, even the main unit 
of communication. An abstract is a useful and 
logical unit because it can be printed on a frame 
of microfilm or on a coded (punched or mag- 
netic) card or on a single sheet of paper. 

Even advanced mechanized systems will 
probably use manually produced abstracts. 
Automation of the abstracting process will be 
one of the last stages in the evolution of fully 
automated systems. The reason is evident after 
a moment's reflection. Difficult as the problem 
of automatic language translation is, it is one 
order of complexity lower than the job of ab- 
stracting. In many ways the difference between 
the two is analogous to the disparity between 
a dictionary translation and a translation 
turned out by a truly bilingual specialist in 
the field. The latter, if a good translator, will 
not simply convert words or even phrases. He 
will restate the author's thoughts in the new 
medium. The abstract must do this same thing. 

Automatic abstracting comprises the ulti- 
mate refinement of automatic translation and 
automatic indexing. It must translate ideas 
from expanded to compressed language, and it 
must be selective. It must also be endowed with 
a product all its own, the attribute of para- 
phrase. 


REPRODUCTION AND DISPLAY 


There are a number of hard copy reproduc- 
tion methods available, including : 


Copy Camera 
Ditto Process 
Electrography 
Letter Press 
Lithography 
Mimeograph 
Offset Printing 


Ozalid 

Photoengraving 

Photogravure 

Photolithography 

Rotogravure 

Thermofax 


There is a method to suit almost every con- 
ceivable need. In general, the obtaining of 
higher-quality, cleaner copy calls for a greater 
investment. 


The situation is not so well resolved in the 
case of display systems. The technology of dis- 
play has generally lagged behind the advance- 
ment of data processing and transmission. 
However, government and industry have begun 
to exhibit a resurging interest in computer-gen- 
erated large-scale displays. The development of 


high-capacity command and control systems 
within the Air Force has emphasized the need 
for sophisthicated data-presentation subsys- 
tems. 

There are three main areas of technological 
effort : 

Projection, using a stable light modulator, 
such as film or selenium plate. 

Light valve, also a projection device, with 
direct electronic control of the image. This 
has advantages of speed and elimination of 
expensive film and the disadvantages of 
resolution and brightness (two urgently req- 
uisite factors in IS&R display). 
Electroluminescence, no projection, experi- 
mental prototypes only. Display surface acts 
both as light source and modulator. 

In general, for the specific applications of 
IS&R, there are no suitably inexpensive sys- 
tems for reproduction and display. Essentially 
what is needed is a low-cost, low-volume device 
to meet the demands of a circulating library 
replaced by a noncirculating, automatic 
transmission-display-reproduction system. Ad- 
mittedly, for many applications, microfilm dis- 
play can be made inexpensive, particularly 
where it is sufficient to exhibit an entire se- 
quence of frames or film. But for IS&R, which 
is a selective application, research and produc- 
tion have not come up with adequate equipment. 

COMMUNICATION LINKS 

Hardware development in IS&R will ulti- 
mately make it unnecessary that a requester be 
physically present at the information center. 
Links for communication between remote re- 
quest stations and the center, which are now 
available or in advanced stages of development, 
range from conventional telephone lines to sys- 
tems employing microwave devices. A closed- 
circuit television system (CCTV) will lend it- 
self very readily to application in IS&R. There- 
fore, the communication system might be as 
simple as one permitting a requester to initiate 
a search by telephone and whose results, in the 
form of abstracts, are returned to him by tele- 
phone or mail. With abstracts and documents 
stored on microfilm and a CCTV/telephone sys- 
tem, the requester could, in a more advanced 
system, view abstracts or the document itself. 
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The ultimate in a communication link be- 
tween the user and the information center 
would permit the user to deal directly with an 
automatic IS&R unit (computer) to initiate a 
search, call up on a viewing screen the docu- 
ments resulting from the search, and reproduce 
these in the form of hard copy. While all the 
functions involved are conceptually simple, con- 
siderable development and system integration 
effort must be accomplished before the ultimate 
system becomes a reality, especially in the de- 
velopment of CRT line scanning (800 to 1200 
lines). 

In the area of volume hard-copy reproduc- 
tion, to take one example, the state-of-the-art 
survey for this report revealed no systems that 
could be called ideally suited to IS&R if one 
considered high-cost trade-offs for systems such 
as electrostatic-video printing. Among the 
lesser problems, considerable work in human 
factors for such a system might be done. De- 
tailed answers to questions of viewing-screen 
brightness and contrast and ambient illumina- 



Figure 4. Equipment State of the Art. 


tion should be made available. While all the 
individual functions involved are conceptually 
simple, considerable development and system- 
integration effort must be accomplished before 
this becomes a reality. Much groundwork has 
been done,^^ Uut the extent to which experience 
in viewing conventional TV visual presenta- 
tions can be applied to the prolonged reading of 
text for an ideal IS&R visual link is ques- 
tionable. 

SUMMARY 

The status of information storage and re- 
trieval is summarized in Figures 4 (Equip- 
ment) and 5 (Techniques). The sizes of the 
boxes are drawn to indicate their relative 
“strength’’ of development of the given area. 
Note that the “Equipment” circle is drawn 
larger than the “Document Processing” circle 
to show the greater strength of development in 
this area. A notable exception is the equip- 
ment for graphics and display, microfilm read- 
ing, and reproduction in particular. 



Figure 5. Technique State of the Art. 
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In two respects, IS&R is not yet an in- 
tegrated science in the sense of chemistry or 
even such an interdiscipline as bionics. First 
of all, the contributing disciplines are diverse 
and overlapping, but hardly unified. Secondly, 
although many nonconventional systems are in 
operation throughout the world, they have little 
in common by way of a unifying description. 

The most glaring deficits are the software 
lag and lack of a truly empirical basis. Today 
is part of an era of machines that can analyze 
data, read and recognize printed symbols, recog- 
nize audible and verbal signals, be conditioned 
and learn, teach humans or other machines, and 
even make decisions. Thin film memories are 
reducing access times to the nanosecond realm. 
A billion items of distinct information can be 
stored in a space the size of a paperback novel. 
But trailing far behind these amazing creations 
in the equipment field are the concepts of how 
people communicate with each other, partic- 
ularly when the scientist or engineer presenting 
his results or theory on paper does not know 
who in what remote corner will want to read 
it, or when the person seeking information has 
some vague unfulfilled desire and does not 
know who in what remote corner has the happy 
cure for his troubles. Each one hopefully relies 
on those who are entrusted with getting the 
answer to the need and finding the market for 
the goods. 

In the meantime, advanced thinkers are 
passing over the concepts-equipment gap and 
are developing theories for conceptual machines 
not yet in existence. And — emphatically — they 
are needed. But concurrent with these efforts 
is a greater need for less speculation and con- 
jecture, less projection of pet ideas and sys- 
tems, and more experimental investigation and 
scientific method. 

The ultimate criterion of how well a retrieval 
system works is the extent to which retrieved 
information meets the need of the user. From 
the outset one is confronted with problems be- 
cause user need is already an idealization. The 
corresponding measure of how well this need is 
met might be called user satisfaction, which is 
also an idealization. Because user need and 
user satisfaction are highly unquantitative and 
i^i^^hly indeterminate, they must be approxi- 
mated. The expression of user need is a request 


and the vague notion of user satisfaction is 
simulated by the notion of relevancy. The situa- 
tion might be visualized as in Figure 6. 




Figure 6. Contrast Between Ideal and Real Informa- 
tion Retrieval. 
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TRAINING A COMPUTER TO ASSIGN DESCRIPTORS 

TO DOCUMENTS: 

EXPERIMENTS IN AUTOMATIC INDEXING 


M. E. Stevens and G. H. Urban 
National Bureau of Standards 
Washington, D. C. 


INTRODUCTION 

During the past five to ten years, increasing 
interest has developed in the use of machines 
as a substitution for human intellectual effort 
in the indexing or classification of the subject 
content of documents. In terms of practical 
applications, the greatest interest to date has 
been in keyword indexing from the significant 
words actually occurring in titles, abstracts, 
or full texts. Pioneering use of computers for 
this purpose, by Luhn* and Baxendale,^ has 
been followed by the development of a number 
of KWIC (keyword-in-context) and similar 
programs. 

This is “derived” or “derivative” indexing as 
opposed to “assignment” indexing.®-® In de- 
rivative indexing, the indexing terms used are 
identical with words chosen by the author him- 
self. In assignment indexing, the terms used 
are typically drawn from a classification sched- 
ule, a subject heading list, or a thesaurus of 
descriptors, and they are assigned on the basis 
of decision- judgments about the appropriate- 
ness of these terms as indicative of the subject 
matter of the document in question. Approaches 
to indexing by computer based upon assignment 
of indexing terms or upon automatic classifica- 
tion techniques have been reported by Baker,i 
Borko,®-^ Maron,®-!® Needham, and Swan- 
son,i® among others. 

Small-scale experiments in automatic assign- 
ment indexing have also been conducted at the 


National Bureau of Standards in recent months. 
The SADSACT (Self-Assigned Descriptors 
from Self And Cited Titles) method is based on 
the two working hypotheses : 

1. That, given only a “teaching sample” of 
previous human indexing of representa- 
tive items, the machine can derive statis- 
tics of association between substantive 
words used in the titles or abstracts and 
the descriptors assigned to these items 
such that patterns of substantive words 
in the titles and cited titles of new items 
can be used for automatic assignment of 
descriptors to these new items, and 

2. That, as in the case of citation indexes, 
the bibliographic references cited by an 
author can provide useful clues to the 
subject of his own paper. 

In addition, we hope to demonstrate an auto- 
matic indexing technique that will minimize 
time and cost of preparation by limiting the 
amount of input material required and by 
capitalizing on byproduct data generation from 
one-time keystroking of descriptive cataloging 
information plus the text of the titles of biblio- 
graphic references cited in new items. 

THE TRAINING SEQUENCE AND INDEX- 
ING PROCEDURE 

In training the computer to assign descrip- 
tors to our documents, we first select a sample, 
presumably representative, of items already 


563 
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in our collection. These have all previously been 
indexed to some pre-established indexing-term 
vocabulary, as the result of human subject 
content analysis. In our experiments to date, 
the “teaching sample” has consisted of ap- 
proximately 100 items drawn from a collec- 
tion of about 10,000 technical reports in the 
fields of information retrieval and potentially 
related research including computer design, 
programming languages, mathematical logic, 
linguistics, pattern recognition, psychology, 
operations research, and the like. These teach- 
ing sample items are ones that had all pre- 
viously been indexed by analysts at ASTIA 
(now the Defense Documentation Center) in 
the year 1960. Descriptive cataloging informa- 
tion, abstracts, and lists of descriptors assigned 
were available. This information was key- 
stroked on Flexowriters and the texts of the 
titles, descriptors, and abstracts were con- 
verted to punched cards for input to the com- 
puter. 

In programming the SADSACT method for 
trial on the IBM 7090-7094 computers, the 
cards for each of the documents in the teaching 
sample are read into the computer, one item at 
a time. A list of descriptors assigned to these 
items is formed, and the frequency of use of 
each descriptor for the sample as a whole is 
tallied. The text of the title and abstract of 
each document is then processed against a 
“stop” list to eliminate the common and non- 
significant words. Each remaining word from 
document title and abstract is associated with 
each of the descriptors previously assigned to 
that document. Sorted lists of English words 
and their descriptor co-occurrence relations are 
formed and read out, using three output tapes. 

When all the documents in a teaching sample 
have been processed, the next step is to identify 
the “validated descriptors” — ^that is, those that 
have occurred three or niore times in a 100-item 
sample.* (The “names” of other descriptors 
assigned to not more than one or two items are 


* In our first teaching sample, 219 different descrip- 
tors had been used by the DDC indexers, and 72 
occurred for three or more items in the sample. Our 
second teaching sample, formed by substituting differ- 
ent items for 30 items withdrawn from the first to be 
used as test material, gave 221 descriptors, of which 70 
were validated. 


retained as ‘'candidate descriptors’’ but word 
co-occurrence data is not retained for these. f) 
For these validated descriptors, the word-de- 
scriptor association lists are then merged into 
a master vocabulary list which gives for each 
word the identity of the descriptors with which 
it co-occurred and the relative frequency of its 
co-occurrence with each descriptor. 

The SADSACT automatic indexing method, 
therefore, uses an ad hoc statistical association 
technique in which each word may be associated 
either appropriately or inappropriately with a 
number of different descriptors. In the sub- 
sequent indexing procedure, reliance is placed 
on patterns of word co-occurrences and on re- 
dundancy in new items to depress the effects 
of non-significant or inappropriate word-to- 
descriptor associations and to enhance the 
significant ones. 

The SADSACT indexing procedure is carried 
out as follows. The text of the title of a new 
item and of titles cited as bibliographic refer- 
ences by the author is keystroked, and the by- 
product punched paper tape is converted to 
cards for input to the computer. This input 
material is processed against the master vocab- 
ulary list to yield, for each word matching a 
word in the vocabulary, a “descriptor-selection- 
score” value for each of the descriptors pre- 
viously associated with that word. When all 
words from titles and cited titles have been 
processed, the descriptor scores are summed 
and for some appropriate cutting level,t those 
descriptors having the highest scores are as- 
signed to the new item. 

The actual score value derived reflects both a 
normalizing factor (based, for example, on the 
ratio of the number of previous co-occurrences 
of this word with a particular descriptor to the 


fOne of the distinctions to be emphasized in assign- 
ment as opposed to derivative indexing is that although 
a word or phrase occurring in a title or in text may 
coincide orthographically with the “name"^ of a de- 
scriptor or subject heading, its occurrence may or may 
not result in the assignment of that descriptor to the 
item in which it occurred. 

}In machine tests to date, the cutting level has been 
arbitrarily set so that exactly twelve descriptors are 
assigned to each item. However, since these are printed 
out in decreasing order of selection-score values, results 
of other cutting levels can readily be computed. 
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number of different words co-occurring with 
that descriptor) and a weighting formula that 
gives greater emphasis to words occurring in 
‘'self ’’-title (the author’s own choice of termi- 
nology) than to those occurring in cited titles. 
Similarly, greater emphasis is given to words 
that coincide with the names of descriptors. 
The latter weighting enables the assignment of 
“candidate” as well as “validated” descriptors, 
if words so coinciding occu^ several times in 
the ir^put material. An extension of this feature 
would allow similar treatment to be given to 
n-tuples of substantive words (such as “auto- 
matic character recognition”) that had occurred 
with some frequency in the teaching sample 
material, so that “potential descriptors” could 
be used to provide change and growth in the 
indexing vocabulary. 

RESULTS TO DATE 

In a previous informal report, we have noted 
the results of applying the SADSACT method 


to eight new (1963) items to our first 100-item 
teaching sample, consisting of selected docu- 
ments indexed by DDC in the spring of 1960. 
These results were disappointing in terms of 
comparison against discriptors assigned by 
DDC indexers, largely because of changes in 
indexing philosophy and vocabulary such that 
many descriptors used by human indexers in 
1963 were not available to the machine. How- 
ever, 49% of the descriptors judged by a sub- 
ject matter specialist to be relevant to these 
items were correctly assigned by the computer 
program. 

Our first report also covered tests on 30 
“new’' items drawn from the teaching sample 
itself in order to provide, quickly, a basis for 
evaluation in which the indexing vocabulary 
could be held constant. Where titles and ab- 
stracts had been used for the teaching sample 
procedure, for this test the titles and cited 
titles only were used, so that a reasonable 
proportion of the input was new rather than 


TABLE 1. Machine “hits” against DDC assignments, 
tests run against T.S. #1 only 

% Hits 
Validated 
Descriptors 


Short Title of Item 


% Hits 
All DDC 
Descriptors 


% Hits 
Descriptors 
Available 
to Machine 


Perceptrons and the theory of brain mechanism 

37.5 

42.9 

60.0 

'(Pattern recognition with an adaptive network 

33.3 

33.3 

50.0) 

Pattern recognition with an adaptive network 

50.0 

50.0 

50.0 

Invariant input for a pattern recognition machine 

25.0 

33.3 

33.3 

An adaptive character reader 

37.5 

60.0 

rrc /\ 

/o.u 

Theory of files 

25.0 

50.0 

50.0 

Learning in pattern recognition 

33.3 

50.0 

50.0 

Reorganization by adaptive automation 

50.0 

66.7 

66.7 

Finite state languages formal representations 

20.0 

20.0 

20.0 

Some computer experiments with predictive 

45.5 

55.6 

71.4 

coding 




On categorical and phrase structure grammars 

50.0 

66.7 

66.7 

Error correcting codes from linear sequential 

33.3 

42.9 

75.0 

Method of improving organization in large files 

57.1 

66.7 

100.0 

Machine program for theorem proving 

60.0 

75.0 

100.0 

Pattern target analysis 

0.0 

0.0 

0.0 

Tables of Q functions for two Perceptron models 

25.0 

33.3 

40.0 

Lineal inclination in encoding information 

22.2 

25.0 

40.0 

TOTAL 

549.2 

713.1 

858.1 

AVERAGE 

34.3 

44.6 

53.6 
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TABLE 2. Machine “hits’’ against DDC assignments, 
tests run against both T.S. #1 and T.S. #2 



T.S. #1 



T.S. #2 

% 

% 

9 '° 

% 

% % 

All 

Available 

Validated 

All 

Available Validated 


Flowgraphs for nonlinear sys- 

14.3 

20.0 

33.3 

14.3 

20.0 

33.3 

tems 







Activity in nets of neuron-like 

9.1 

16.7 

33.3 

18.2 

33.3 

66.7 

elements 

Survey of cueing methods in 

42.9 

50.0 

60.0 

42.9 

50.0 

60.0 

education 

Facet design and analysis of 

37.5 

42.9 

50.0 

12.5 

14.3 

16.7 

data 

Automatic recognition of speech 

25.0 

28.6 

66.7 

25.0 

28.6 

66.7 

Problems in nonlinear control 

33.3 

50.0 

50.0 

16.7 

25.0 

25.0 

systems 

Guiding principles for time and 

40.0 

100.0 

100.0 

40.0 

100.0 

100.0 

cost 

Approach to pattern recognition 

44.4 

50.0 

57.1 

33.3 

37.5 

42.9 

using linear threshold devices 
Simultaneous multiprogram- 

28.6 

40.0 

66.7 

21.4 

30.0 

50.0 

ming of electronic computers 
Design and study of correlation 

22.2 

44.4 

80.0 

22.2 

44.4 

80.0 

instrumentation 

The learning model approach 

27.3 

30.0 

33.3 

27.3 

30.0 

33.3 

Soviet computer technology 

50.0 

50.0 

60.0 

50.0 

50.0 

60.0 

Computer applications of Boo- 

37.5 

75.0 

75.0 

25.0 

50.0 

50.0 

lean functions 

Sequential encoding and decod- 

36.4 

36.4 

50.0 

27.4 

27.4 

37.5 

ing 

Design of a language for Sim- 

75.0 

75.0 

75.0 

75.0 

75.0 

75.0 

plex system users 

Automated teaching a review 

55.6 

55.6 

71.4 

55.6 

55.6 

71.4 

TOTAL 

579.1 

764.6 

981.8 

506.8 

671.1 

888.5 

AVERAGE 

36.2 

47.8 

61.4 

31.7 

41.9 

55.6 


“source” material. For these 30 items, the 
average “hit” accuracy was 64.8%. This in- 
cluded the occasional assignment of candidate 
descriptors in cases where words in title or cited 
titles coincided with the names of candidate 
descriptors frequently enough to build up a 
score above the cutting level. (For example, 
the candidate descriptor “Sleep” was assigned 
to an item entitled “Human performance as a 
function of the work-rest cycle.”) Considering 
validated descriptors alone, the machine as- 
signment accuracy was 77.4%. 


These results were suspect, however, because 
of the probable source material bias. Accord- 
ingly, substitutes were prepared for each of 
these first “test” items to form a second teach- 
ing sample consisting of 70 items from teach- 
ing sample #1 and 33 additional items. Items 
from teaching sample #1, now genuinely 
“new,” and some additional items were then 
run against teaching sample #2. Items new to 
both samples were also run against sample #1. 
Results for 101 machine tests on 59 different 
items are shown in Tables 1, 2, and 3 in terms 
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TABLE 3. Machine “hits” against DDC assignments, tests run against 
T.S. #2, both with cited titles and with abstracts 


Cited Titles Abstracts 



% 

All 

% 

Available 

% 

Validated 

% 

All 

% % 
Available Validated 

Construction of convolution 

66.7 

66.7 

66.7 

66.7 

66.7 

66.7 

codes by sub-optimization 

Iterative circuit computers 

40.0 

40.0 

40.0 

40.0 

40.0 

40.0 

Use of context cues in teaching 

20.0 

25.0 

28.6 

20.0 

25.0 

28.6 

machines 

Model for communication with 

42.9 

42.9 

50.0 

28.6 

28.6 

33.3 

learning 

Word length and intelligibility 

28.6 

33.3 

40.0 

14.3 

16.7 

25.0 

Machine correction of garbled 

71.4 

71.4 

71.4 

85.7 

85.7 

85.7 

English text 

Multiple task study on automatic 

50.0 

50.0 

50.0 

75.0 

75.0 

75.0 

data processing 

Independence of attributes in memory 

50.0 

66.7 

66.7 

25.0 

33.3 

33.3 

Use of models in experimental 

42.9 

50.0 

60.0 

28.6 

33.3 

40.0 

psychology 

Linear recurrent binary error 

42.9 

42.9 

50.0 

57.1 

57.1 

66.7 

correcting codes 

Adaptive sample data systems 

42.9 

50.0 

75.0 

28.6 

33.3 

50.0 

Common programming language task 

80.0 

80.0 

80.0 

80.0 

80.0 

80.0 

Some requirements for produc- 

14.3 

50.0 

100.0 

14.3 

50.0 

100.0 

tion control data 

Remembering the present state 

25.0 

25.0 

33.3 

25.0 

25.0 

33.3 

of a number of variables 

Papers on the APT language 

57.1 

80.0 

80.0 

57.1 

80.0 

80.0 

Topics in the theory of discrete 

100.0 

100.0 

100.0 

100.0 

100.0 

100.0 

information channels 

Some aspects of problem solving 

28.6 

28.6 

33.3 

28.6 

28.6 

33.3 

Predictive model for self-organ- 

25.0 

25.0 

33.3 

50.0 

50.0 

66.7 

izing systems 

Adaptive switching circuits 

29.9 

38.5 

55.6 

29.9 

38.5 

55.6 

Techniques of pictorial data reduction 

71.4 

83.3 

83.3 

57.1 

66.7 

66.7 

Combining of classes condition 

37.5 

50.0 

60.0 

50.0 

66.7 

80.0 

in learning 

On some relations between 

57.1 

57.1 

66.7 

42.9 

42.9 

50.0 

human engineering 

Human performance as a func- 

9.1 

14.3 

25.0 

18.2 

28.6 

50.0 

tion of the work-rest cycle 

The simulation of human thought 

85.7 

85.7 

85.7 

71.4 

71.4 

71.4 

Studies of communication channels 

14.3 

16.7 

50.0 

14.3 

16.7 

50.0 

Concept of the ideal observer in 

44.4 

44.4 

50.0 

33.3 

33.3 

37.5 

psychophysics 

Learning in random nets 

— 

— 

— 

100.0 

100.0 

100.0 

TOTAL 

1177.7 

1312.5 

1544.6 

1241.6 

1368.1 

1609.2 

AVERAGE 

45.3 

50.5 

59.4 

45.9 

50.7 

59.6 
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of the percentages of descriptors originally as- 
signed by DDC indexers which were also as- 
signed by machine. These results show, even 
more than had been expected, a significant drop 
below the ‘‘hit’' accuracy obtained when the 
same items were run under partially “source 
material” conditions. § 

The overall average “hit” accuracy for these 
tests is only 40.1%, considering all descriptors 
assigned to these items by DDC. However, in 
spite of the use of test items drawn from the 
same time period as the teaching samples in 
order to maximize consistency of indexing 
vocabulary, 19.1% of the descriptors assigned 
by DDC were not available to the machine. 
When corrected for this not-available factor, 
the average “hit” accuracy was 48.27^. Other 
descriptors used by DDC for these items were 
available to the machine only as candidate de- 
scriptors and could only be assigned if input 
words for the test items coincided with the 
names of these descriptors. Considering only 
the descriptors that were freely available to 
machine, the validated descriptors, the com- 
puter assignment accuracy was 58.1%. In only 
one test case, for these 59 items, were no hits 
achieved by the machine method (“Pattern 
target analysis”). 

Figures 1 and 2 represent, respectively, ma- 
chine assignments for title-and-cited-titles and 
for title-and-abstract of an item subsequently 
evaluated by several members of our staff, as 
shown by initials. Checks to the left of the de- 
scriptors indicate agreements with DDC as- 
signments. The machine missed three DDC as- 
signed descriptors for this item: “Electronic 
Circuits,” “Machine Tools,” and “Servomech- 
anisms,” the latter two not being available to 
the machine system. 

Figures 1-2 illustrate the present format for 
computer printout, with assignments in order 
of decreasing selection scores (numeric data 
to the right of descriptor names) for three dif- 
ferent weighting formulas. These figures also 
illustrate differences in order of machine selec- 
tion and of actual descriptors chosen by the 

§This is apparently due to the fact that although 
only each item’s own title was source material, the 
weighting formulas to emphasize significance of words 
in the self -titles changed the effective proportion of the 
new material in the cited titles. 
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SADSACT technique using cited titles as 
against abstracts as input. 

In our earlier tests, the same items had been 
run both with cited titles and with abstracts 
against the teaching sample in which they had 
been included (that is, as both partially and 
entirely source material items). For these 
items, the 64.87^ hit accuracy for the cited 
title case improved to 75.0 7c average accuracy 
in the abstracts case. However, when the new 
test items were run both ways against teaching 
sample #2, there was no significant difference 
in the average accuracies obtained as between 
using titles-and-cited-titles only and using 
titles-and-abstracts from the same items. 

Problem of Evaluatioyi 

The problems of evaluation of any indexing 
method whatsoever, whether carried out by 
man or by machine (or by men aided by ma- 
chines) are crucial. O’Connor has aptly 
pointed out that: . . the question, Ts mech- 

anized indexing possible?’ by itself is not 
answerable, because it is not intelligible. More 
positively expressed, if you want to ask clearly 
about the possibility of mechanized indexing, 
you should have both some notion of ‘good re- 
trieval,’ and some notion of how good you want 
the retrieval to be which is permitted by mech- 
anized indexing.” 


In advance of evaluations based on retrieval 
tests, we have relied, first, on comparisons with 
the prior human indexing. We have been faced 
with the mechanics of assuring consistency-of- 
indexing criteria as necessary for any realistic 
assessment of machine as compared to manual 
indexing of the same items. As we have noted, 
this is not possible even for “training” and 
“test” items which are all documents of in- 
terest in our collection and which were all in- 
dexed in approximately the same time period 
by the same DDC (ASTI A) indexer-staff. As 
between “teaching sample #1/' “teaching 
sample #2” (of 100 and 103 items, respec- 
tively, with 70% overlap), and of test items 
“new” to both these samples there are obvious 
differences as to the descriptor-vocabulary and 
indexing philosophy. This type of discrepancy 
was even more marked in the first run of eight 
(1963) items against the (1960) basis of teach- 
ing sample #1. 

More serious, however, are the questions of 
the O’Connor remarks previously cited: those 
of evaluation of any indexing method whatso- 
ever. We have, in our test data, a provocative 
case in point. In Table 4 we show the m_achine- 
assigned descriptors for the paper by L. G. 
Roberts of M.I.T. entitled “Pattern recognition 
with an adaptive network.” Although exactly 
12 descriptors were required, by our present 


TABLE 4. Descriptors assigned by computer to 
“Pattern Recognition with an Adaptive Network” 

Data 

Sampling 

Learning 

Simulation 

Computers 

Data Processing Systems 
Design 

Digital Computers 
Mathematical Prediction 
Mathematical Logic 
Digital Systems 
Mathematical (.-omputer Data 
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program, to be assigned, none of the 12 so 
assigned by machine is entirely irrelevant or in- 
appropriate. As it happens, this same document 
was indexed twice by DDC personnel, first as 
an unpublished report, later as a reprint of the 
version appearing in the open literature. In- 
dexer A assigned to this item the four descrip- 
tors: “Coding,"’ “Digital Computers,” “Mathe- 
matical Computer Data,” and “Programming.” 
Indexer B assigned : “Digital Computers,” 
“Electrical Networks,” “Identification Sys- 
tems,” “Programming,” “Simulation,” and 
“Test Methods.” 

It is of interest, first, that although our ma- 
chine method “hit” score was 50% with respect 
to Indexer A, and only 33.3% with respect to 
Indexer B, at the same time A’s agreement with 
B was exactly 33.3% and B’s with A was 507^* 
More significantly, however, the machine 
method was the only one of the three which 
assigned the crucial descriptor “Learning” (re- 
fiecting the self-organizing and adaptive fea- 
tures of the mechanism described) to the item. 
Indexer B assigned two ^appropriate descrip- 
tors — “Test Methods” and “Electrical Net- 
works,” but the SADSACT technique did not 
make either mistake, even though the word 
“network” did appear in the item’s own title. 

A second problem of evaluation we are faced 
with involves the consequences of an assump- 
tion that the previously-given hurpan indexing 
decisions, for a sampling of our collection, are 
in fact “good enough.” This is a misleading as- 
sumption. As a case in point, of the approxi- 
mately 200 items handled either in the teaching 
samples or as “new” items, only one could be 
appropriately indexed under the descriptor 
“Computers” that also involved analog com- 


these items, if “Computers” was a relevant 
puters or systems. Thus, for all but one of 
descriptor, so also was the descriptor “Digital 
Computers.” There is considerable debate about 
claims for “consistency” of machine, as opposed 
to manual, methods of indexing. In some of 
our tests, for example, we assigned both the 
descriptor “Computer” and the descriptor 
“Digital Computers” in 41.2% of the cases in 
which either was assigned, but the human in- 
dexers assigned both if either was used only 
33.3% of the time. 

In general, very little data are available on 
interindexer consistency or on the likelihood 
that the same indexer would index the same 
item the same way on different occasions. 
Painter reports figures for interindexer con- 
sistency as ranging from 54% similarity of 
indexing to 82%. Her data for the case of 
DDC show that: “One hundred and eleven 
starred descriptors were used the first time 
which did not appear the second (41%) and 98 
starred descriptors were used the second time 
and not the first (36% ).” Jacoby and Slamecka'^ 
report even less favorable figures, with inter- 
indexer consistency as low as 18% in some 
cases. In the light of such indications, the auto- 
matic assignment indexing techniques appear 
promising. In addition, however, we have initi- 
ated the investigation of relevance judgments 
on the part of subject matter specialists review- 
ing the full texts of our test items. 

Evaluation by Relevance Judgments 

The data shown in Tables 1-3 compare ma- 
chine assignments with those previously made 
by human indexers. An alternative method of 
evaluating the test results is to determine the 


TABLE 5. Average per-cent agreement, one or more 
people with machine assignments 


No. of Indexers 

No. of Items 

12 

No. of Descriptors Assigned 

9 6 3 

1 

2 

1 

50.0 

66.7 

66.7 

100.0 

100.0 

3 

4 

47.9 

52.7 

58.3 

66.7 

75.0 

4 

10 

40.0 

45.7 

55.6 

53.3 

60.0 

5 

10 

54.2 

58.9 

68.3 

80.0 

90.0 


OVERALL 

47.4 

52.9 

61.6 

67.9 

76.0 
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probable relevancy of those descriptors that are 
assigned by the machine. Twenty-five of the 
items in the test runs have therefore been sub- 
mitted to one or more members of our staff, all 
of whom are users of the collection. They were 
asked to choose 12 descriptors for each item ex- 
clusively from the list of descriptors actually 
available to the machine. The results of these 
evaluations are shown in Table 5, which gives 
the summary data for judgments of relevance 
by one or more persons (including the initial 
DDC indexer) of descriptors assigned by the 
SADSACT technique. It should be noted that 
these judgments are based upon the independ- 
ent assignment by the human analysts of these 
same descriptors to the same item, rather than 
by review of the machine assignments. Thus 
the results are probably conservative. 

The results in Table 5 also show thfe percent- 
ages of machine-assigned descriptors judged 
by human analysts to be relevant for cases 
where the machine assigns fewer than 12 de- 
scriptors. It is evident that the fewer descrip- 
tors assigned by machine, the better the chance 
that human evaluators will judge those assigned 
to be relevant. It is also evident that the more 
the people who index, the better is the chance 
that one or more will assign a descriptor that 
the SADSACT model also assigns. 

Tables 6 and 7 provide more detailed data 
for the cases where independent evaluations 
were made by several people. Table 6 shows 
machine-indfexer and interindexer agreement 
for the 10 cases where four of our own ‘‘cus- 
tomers'' had analyzed the items. It provides, 
for each of the different descriptors assigned by 
machine at the 6-descriptor cutting level, indi- 
cation of the i em for which these assignments 
occurred, and the agreement with assignments 
of that descriptor. Table 7 gives, for 20 cases 
in which four or more people made independent 
evaluations, the chances that the machine will 
choose descriptors also chosen by people for 
the first descriptor assigned, first two, ... up to 
the total of 12. 

Taking both the “hit" accuracy and the 
human agreement data into account, the SAD- 
SACT results compare favorably with those re- 
ported by Maron^ (51.8%) and by Borko^ in 
an experiment using the same computer litera- 
ture abstracts as had been used by Maron 


(46.5%). Our results to date also appear to 
fall within the range of agreement-data for 
human interindexer consistency. 

Future Consider atious 

Our present plans call, first, for continued 
testing and evaluation of a number of new 
items, varying such parameters as the size of 
the teaching sample, the number of descriptors 
to be assigned, and weights for both prior- 
association occurrences and descriptor selec- 
tion. This additional experimentation can be 
carried out using the present preliminary pro- 
gram. Depending on the results of continued 
testing, we except to explore applicability of 
the method to other subject matter areas (both 
for more homogeneous and more heterogeneous 
collections) if they are favorable, and to study 
means for improvement by various means of re- 
introducing human judgment into the process, 
if the results are discouraging. 

If the promise of the early results is sub- 
stantiated on further runs, we shall also wish to 
re-investigate and refine the programming tech- 
niques. For our preliminary evaluation, many 
ud hoc choices and arbitrary assumptions were 
made. These will require modification. Addi- 
tional programming refinements should provide 
for consolidated vocabulary-descriptor associa- 
tions for larger teaching samples, incorporation 
of occurrences of n-tuples of significant words 
as “potential descriptors," experimentation 
with various weighting formulas, and the like. 

The SADSACT indexing procedure differs 
from most other efforts at realization of as- 
signment-type indexing by machine in two 
principal respects. First is the emphasis on 
a posteriori, completely mechanizable, tech- 
niques that nevertheless are based, in a sam- 
pling sense, upon prior judgments of people 
with respect to specific requirements of real job- 
situations. The second differentiating aspect is 
the insistence not only upon obtaining the 
consensus of the collection as the basis of 
vocabulary compilation and word-descriptor 
associations, but also of going beyond the in- 
dividualistic usage of language and terminology 
by a particular author. This latter emphasis 
tends to reduce the effect of linguistic idosyn- 
cracies and consequent scatter of like items 
under a number of different indexing terms. 





TABLE 6. Agreements, by descriptor and item, of one or more persons 




with first six descriptors assigned by machine to each of 





10 test items 


Agreement 



Number of persons agreeing with machine assignment 


Assignments 





of this 

Descriptor 


5 4 3 2 1 

0 

Descriptor 


CCCS’' 

, LRBE, 



Coding 

CPLT 

TTDI 


100.0 

Theory 

SHT 

CCCS, CPLT TTDI 

WLI, see 

66.7 

Errors 


CCCS, LRBE 


100.0 

Data Transmis- 


CCCS LRBE see 


100.0 

sion Systems 





Electronic 



CCCS, LRBE 

0.0 

Circuits 




Information 

Theory 

TTDI 

CCCS 


100.0 

Language 


WLI, CPLT, 

PAL 


100.0 

Machine 

Translation 


CP LT PAL 

WLI 

66.7 

Data Processing 
Systems 


MTSA ASDS 


66.7 

Intelligibility 


WLI 


100.0 

Bibliography 



WLI, TTDI 

0.0 

Design 


MTSA ASDS 

SHT 

66.7 

Human 

Engineering 

MTSA 

SHT 


100.0 

Automatic 


PAL MTSA 


100.0 

Synthesis 



LRBE 

0.0 


Classification 


LRBE 


0.0 
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Data Storage 
Systems 





ASDS, PAL 
MTSA 

0.0 

Data 


ASDS 




100.0 

Mathematical 

Logic 






0.0 

Communications 

Theory 


see 




60.0 

Programming 

CPLT, PAL 





100.0 

Digital 

Computers 


CPLT 

PAL, SHT 



100.0 

Probability 



TTDI 

see 


100.0 

Reasoning 

SHT 




TTDI 

50.0 

Computers 

SHT 





100.0 

Circuits 




see 


100.0 

Tracking 





sec 

0.0 

Documentation 





MTSA 

0.0 


*Note : Items are identified by initials of first 4 significant words in titles, see Tables 1-3. 
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TABLE 7. Number of descriptors assigned by machine, also chosen 
by one or more human indexers 


Item 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

cccs 

1 

2 

3 

4 

4 

5 

6 

6 

7 

7 

7 

7 

ICC 

1 

2 

2 

3 

4 

5 

6 

7 

8 

8 

8 

8 

UCCT 

1 

2 

2 

3 

4 

5 

5 

5 

5 

5 

5 

5 

MCL 

1 

2 

3 

4 

5 

5 

5 

5 

5 

6 

6 

6 

WLI 

1 

1 

1 

1 

2 • 

2 

3 

3 

4 

4 

5 

5 

MCGE 

0 

1 

1 

2 

3 

4 

5 

5 

6 

7 

7 

7 

MTSA 

1 

2 

3 

4 

4 

4 

5 

5 

5 

5 

5 

5 

lAM 

1 

2 

2 

2 

3 

3 

3 

3 

3 

3 

4 

5 

LRBE 

0 

0 

1 

2 

3 

3 

4 

4 

4 

5 

5 

6 

ASDS 

1 

1 

2 

2 

2 

3 

4 

5 

5 

5 

5 

6 

CPLT 

1 

2 

3 

4 

5 

6 

6 

6 

6 

6 

6 

6 

SRPC 

0 

1 

2 

2 

2 

3 

3 

3 

3 

3 

4 

4 

RPSN 

0 

0 

0 

1 

1 

2 

2 

3 

3 

4 

4 

4 

PAL 

1 

2 

3 

4 

5 

5 

6 

6 

7 

7 

8 

8 

TTDI 

1 

2 

3 

3 

3 

4 

4 

4 

4 

5 

6 

7 

PMSO 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

HPFW 

1 

1 

1 

1 

1 

1 

2 

2 

2 

2 

2 

2 

SHT 

1 

2 

3 

4 

4 

4 

4 

5 

5 

6 

7 

7 

see 

1 

2 

3 

4 

4 

4 

4 

5 

5 

6 

7 

7 

CIOP 

0 

1 

2 

3 

4 

5 

6 

6 

6 

6 

6 

6 

Average Per-cent 
Chance that people 

75.0 

70.0 

66.7 

66.3 

64.0 

62.5 

60.7 

56.2 

52.8 

51.0 

49.1 

47.1 


will agree 


It also tends to produce a normalizing effect so 
that the various items in the collection are in- 
dexed more consistently than is possible if each 
item is processed as an entirely independent and 
self-sufficient entity. 

The word “training” as applied to our pro- 
posed method for automatic indexing is some- 
what of a misnomer in terms of our actual ex- 
periments to date. It is justified in the sense 
that a training sequence, using a “teaching 
sample,” gives the machine a representative set 
of “acceptable” input-to-desired-output ex- 
amples. It is not justified, as yet, in the sense 
that the system as presently programmed could 
significantly modify or adapt its procedures, 
without human intervention, to new vocabulary 


or subject emphases in the input items. There 
is reason to suppose, however, that the method 
can be adapted to new and changing vocabu- 
laries (both of “descriptors” and of author 
terminology) by throwing out for human in- 
spection : occurrences of words not on the gen- 
eral stop-list, yet not in the vocabulary; and 
occurrences of non-purged words in title-and- 
citations coinciding with the name of a descrip- 
tor, when that descriptor has not been assigned 
(e.g., an early occurrence of the word “plasma” 
in a physics paper, but not involving enough 
other words previously associated with blood 
chemistry, hematology, and the like, to assign 
physiological or medical field descriptors) . 

If this proves feasible, potentialities of 
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training the machine system to recognize new 
descriptor-assignment possibilities, to sharpen 
its own recognition-selection-assignment proc- 
esses in terms of current real-life input, and to 
recognize and display anomalies, changes, and 
trends, may supply significantly improved prob- 
lem-solving capabilities in the changing world 
of literature search as aided by machine. 
Although the SADSACT results to date are 
quite limited, the procedure indicates some 
promise of being quick, relatively inexpensive 
and consistent, and it capitalizes on one-time 
key stroking of descriptive cataloging informa- 
tion available for multiple purposes. 
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1. INTRODUCTORY REMARKS 

It is reasonable to suppose that information 
systems of the future will have control over 
portions of a document as well as over full text. 
In addition, we suppose that many users will 
want output in order to verify hypotheses or to 
form new hypotheses. This paper is concerned 
with the facilitation of the second procedure. 
Thus the emphasis is not on the two classical 
problems of information system.s, nam.ely, not 
retrieving relevant material, or retrieving ir- 
relevant material; but rather on the problem 
of assimilability of the output by the user. 

The particular aspect of the assimilability 
problem which will concern us is the effect of 
information loads of varying sizes upon the 
user’s ability to assimilate and correlate data. 
It seems plausible to assume that the user’s 
ability to correlate information is a function 
of the volume of information to be assimilated ; 
and moreover, that this ability will improve 
with increasing volume until a certain critical 
volume is reached, beyond which the load of 
information causes the user’s correlation ability 
to deteriorate rapidly. This assumption gives 
rise to two fundamental questions : 

1. What is this critical volume in a given 
situation? 

2. Is it possible to structure information in 
some way so as to improve the user’s 


ability to assimilate and correlate, thus 
extending the critical limit? 

Experiments performed in connection with 
an earlier project on information retrieval and 
correlation provided some interesting prelimi- 
nary answers to these questions. 

The library for this series of experiments 
consisted of a simulated intelligence file of sen- 
tence fragments developed from a book — 
'"Kogun — The History of the Japanese Army in 
the Pacific.” A machine-produced concordance 
of all words contained in the fragments served 
as an index to the file. 

The experiment of particular significance 
here is one which dealt specifically with the 
time dimension. All fragments containing any 
reference to time of day (for example, “noon,” 
“0845,” “dawn,” “8:00 a.m.”) were retrieved. 
About 25 fragment cards were retrieved in all. 
The fragments of this set had at least two at- 
tributes in common : they concerned the Pacific 
War in some way, and contained some reference 
to time of day. 

The randomly ordered cards were presented 
to several persons, who noticed only that all 
cards referred to battles in the Pacific War. 
When the cards were reordered into two groups 
of night versus day, however, a striking cor- 
relation emerged — namely, that all daylight at- 
tacks had been initiated by U.S. forces and all 


* The work reported in this article was supported by the National Aeronautics and Space Administration under 
Contract NASw-538. 
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night attacks by the Japanese. (This correct 
inference, incidentally, was nowhere explicitly 
stated in the book.) 

The results of this experiment suggest that 
the critical volume of information may in fact 
be extremely small. Only 25 cards were in- 
volved, and each card contained a single frag- 
ment, which in this case was a minimal piece 
of information. Moreover, these fragments had 
certain attributes in common — ‘‘Pacific War’’ 
and “time of day.” The correlation described 
above, however, was not discovered until the 
set of fragments was structured in terms of 
conceptual subgroups — “night, “day” — in- 
cluded in the semantic range represented by 
“time of day.” 

These findings suggest that structuring of 
data can extend the critical limit on volume, 
and further, that the elements of this structure 
should be subsets consisting of closely related 
members of the main set. The results of the 
earlier experiment thus established a point of 
departure for the present investigation. 

2. EXPERIMENTAL DESIGN 

A considerably more complex series of experi- 
ments was designed to test whether the 
“Kogun” discoveries provided essentially cor- 
rect answers to the two questions raised above. 
We decided to create a set of data structured 
along the lines described above — ^that is, a set 
whose structure is based on subsets consisting 
of closely related members of the main set. A 
second set of data more loosely structured than 
the first would be provided by the main set 
itself. A third set would consist of unstruc- 
tured data compiled randomly. The ability to 
assimilate and correlate the information pre- 
sented in these variously structured sets of 
data would then be tested by requiring experi- 
mental participants to list inferences suggested 
by each set. Based on the “Kogun” experiment, 
it was assumed that the first set of data, being 
the most highly structured, and therefore, most 
easily assimilated, would be most productive 
of inferences; and the random data, least pro- 
ductive. The scope of the initial experiments 
was limited to generating and testing the first 
set of data. 


The type of structure represented by the first 
set of data is most conveniently generated by 
“clustering” techniques. The object is to cluster 
the data in natural groupings whose members 
are associated through co-occurrence of con- 
cepts. The procedure is as follows : if two items 
of the data set have a concept in common they 
are said to be associated. A cluster is then de- 
fined as a collection of items which are mutually 
associated as well as being maximal with re- 
spect to this property. This last condition means 
that an item cannot be added to the collection 
without violating the property of mutual asso- 
ciation. The definition of association can be 
strengthened by stipulating that several con- 
cepts must occur in common. The technique 
used is a mathematical procedure for exhaus- 
tively listing these clusters. 

3. EXPERIMENTAL PROCEDURE 

Materials to be used for the creation of the 
three data sets were a fragment file of sentences 
generated from a collection of documents on 
advanced propellants and a machine-produced 
concordance which served as an index to the file. 

The first step in the experimental procedure 
consisted of selecting a single fragment called 
the Fi fragment, which would serve as the 


f=5 



Figure 1. Illustration of F^ star. 
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nucleus for a set consisting of all fragments 
related to the Fi fragment through co-occur- 
rence of concepts. This is diagrammed in Fig- 
ure 1. The lines between fragments indicate the 
co-occurrence of concepts; for example, if F^ 
contains the words ‘‘small motors” and F 2 con- 
tains the word “micro-burner,” then the same 
concept can be said to occur. The set so formed 
is called the Fi “star.” Clusters of fragments 
associated by co-occurrence of concepts in addi- 
tion to the Fi defining concepts can then be 
derived from the Fi star. 

As an illustration, in Figure 1 the set {F^, F 2 , 
F2} is a cluster, so are {Fi, F4}, {F,, F5}, {Fj, 
F«}. These clusters can serve as the maximally 
structured set of data for the experiments, 
while the members of the Fi star would consti- 
tute the loosely structured set. 

Since the magnitude of the task of develop- 
ing these structures manually is directly pro- 
portional to the number of fragments involved, 
it is essential that the membership of the star 
should not be too large. On the other hand, if 
its membership is too restricted, the star will 
not contain enough information for valid gen- 
eralizations to be made. As the membership of 
the Fi star depends on the number of concepts 
contained in the Fi fragment and their distri- 
bution throughout the fragment file, it can be 
seen that selection of the Fi fragment is the 
crucial step in the experimental procedure. 

Attempts to limit the size of such stars by 
arbitrarily restricting the number of fragments 
taken from each document proved unsuccessful, 
as did several other artificial means of coping 
with an oversupply of related fragments. Many 
potential Fj fragments were selected, found in- 
appropriate, and rejected before the following 
Fi fragment was chosen : 

“Space mission studies have indicated the 
need for small rocket motors of low thrust 
(of the order of 100 lb.) for attitude con- 
trol, midcourse trajectory corrections, ren- 
dezvous, and so forth.” 

Concepts used to define the Fi star were as 
follows : 

small and motors or micro and motors 
burners burners 

engines engines 


space 

trajectory 

control and attitude 

rendezvous 

All cards containing these concepts were then 
retrieved from the file. There were 24 in all, 
yielding an Fi star of 24 members, which 
proved to be a workable amount. 

A list of concepts was then developed for 
each of these fragments and the list for each 
fragment was compared with that of all other 
fragments in the Fi star to determine which 
fragments had the greatest number of concepts 
in common. A correlation matrix was created 
in which fragments having two or more con- 
cepts in common were labeled as strongly cor- 
related, those having only one common concept 
as weakly correlated. “Strong” and “weak” 
clusters were then derived from the matrix by 
the mathematical technique mentioned above. 

Since a greater degree of relatedness or 
structure is required by the defining criteria for 
strong versus weak clusters, these clusters 
represent legitimate materials for testing the 
user’s ability to assimilate and correlate infor- 
mation having various degrees of structure. It 
was expected that inferences derived from 
strong clusters would exhibit some distinctive 
property as compared to weak clusters. 

A further test of assimilation and correlation 
ability could be achieved by imbedding the 
clusters in successively larger segments of text, 
on the theory that correlations obvious in the 
clusters as represented by the fragment cards 
would be obscured if the clusters were instead 
represented by the documents from which they 
were culled. We therefore selected 12 clusters 
— 6 strong and 6 weak — which were repre- 
sented by four levels of material : 

Level Description 

D Cluster is represented by the fragment 
cards. 

C Cluster is represented by the sentence on 
the fragment card plus the six sentences 
(3 preceding, 3 following) which form 
its immediate context within the docu- 
ment. 
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B Cluster is represented by a machine-pro- 
duced extract of the document. 

A Cluster is represented by the document 
itself. 

The clusters were then divided into four 
groups of three clusters each. The material for 
level A of the first group of clusters was placed 
in a folder marked for the first participant 
along with the level B material for the second 
group of clusters, the level C material for the 
third, and the level D material for the fourth, 
so that none of the four participants (specialists 
in the field of advanced propellants) would have 
the opportunity to draw inferences from the 
same cluster on two levels. These elaborate pre- 
cautions were to prevent the participants from 
becoming familiar with the same cluster 
through its representations on various levels, 
causing a tendency to make the same inference 
for the different levels rather than looking for 
new correlations which might be exposed in 
proceeding to a level where the cluster is less 
deeply imbedded in peripheral material. The 
participants were instructed to begin with level 
A and proceed to level D, observing the time 
limits (given below) imposed for reading the 
material and making inferences : 

Table of Maximum Times 


Time for Deriving 

Level Reading Time Inferences 


A 

B 

C 

D 


One-half hour 
20 minutes 
15 minutes 
5 minutes 


20 minutes 
20 minutes 
20 minutes 
20 minutes 


4. RESULTS 

A total of 152 inferences were listed by the 
participants. Of these, exactly half were de- 
rived from strong clusters and half from weak. 
The level from which the largest number of 
inferences was derived was the A level, which 
also contained the most material. 51 inferences 
were listed for the A level, 42 for the B level, 
34 for the C level and 25 for the D level, indi- 
cating a strong correlation between quantity of 
material and number of inferences produced, 
which we had intended to offset to some extent 
by imposing time limits for reading and infer- 


cen'ce listing. The S- 16 cluster (representing 
level D) and the inferences derived from it are 
listed below. 

S-16 Cluster {Level D) 

This motor uses small quantities of propel- 
lants which can be made on a laboratory 
scale yet is large enough that the heat 
transfer losses are small. 

The combustion gases are cooled to a much 
greater extent relative to that in a larger 
motor, thus lowering c (characteristic 
velocity) . 

Space mission studies have indicated the 
need for small rocket motors of low thrust 
(of the order of 100 lb.) for attitude con- 
trol, midcourse trajectory corrections, ren- 
dezvous, and so forth. 

Inferences for Cluster S-1 6 

1. Inferences at A~level 

1. Propellant evaluations do not apear to be 
standardized. 

2. Inferences at B4evel 

1. Hypergolic rocket propellants permit 
optimum utility of small thrust engines. 

2. Injector configuration is a critical pa- 
rameter in achieving maximum perform- 
ance of small, low thrust rocket engines. 

3. High energy, toxic rocket propellants 
can be handled without hazard. 

3. Inferences at C-level 

1. Propellants can deteriorate upon stand- 
ing over long periods of time. 

2. Impurities may be the cause of uncon- 
trolled burning and/or detonations and 
decrease performance of propellants. 

3. In small motors heat transfer through 
components lowers performance signifi- 
cantly. 

4. Better mixing of propellants increases 
performance. 

5. N2O2 and N2H4 are the best state-of-the- 
art combinations which satisfy require- 
ments. 

6. Testing on small sub-scale is sufficient 
for accepting or rejecting a propellant 
combination for use on larger motor 
tests. 
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4, Inferences at D4evel 

1. Low thrust, efficient, simple rocket 
motors have a variety of potential re- 
search and development applications. 

2. In a small motor, combustion gases are 
cooled to a greater extent than in a 
large motor probably because of the 
greater ratio of heat transfer surface to 
combustion chamber volume in the small 
motor. 


The inferences were then uniquely classified 
into the following categories : 


Propulsion Systems 
Propellant Combina- 
tions 

Propellant Evalua- 
tions 

Reliability 

Performance 

Design 

Chambers 

Storage and Handling 


Combustion Efficiency 
Combustion Stability 
Ignition 
Heat Transfer 
Problem Areas 
Extrapolation of Test 
Results 

Applications for Small 
Motors 


The inferences derived on the various levels 
were compared to determine whether the classes 
of inferences had members from all levels; or 
whether, as we had postulated, the relations 
between members of a cluster would be ob- 
scured when the cluster was represented by 
large segments of data, so that the same type 
of inference could not be derived from all repre- 
sentations of the same cluster. This assumption 
proved false in the present investigation, which 
might be attributed to the fact that the critical 
volume was not reached due to the overgenerous 
time limits. The same types of inference were 
frequently listed for levels A through D, 


The original fragment cards making up the 
clusters were then assigned to as many of the 
above categories as were applicable in order to 
compare the extension of the clusters with that 
of the inferences derived from them. Intersec- 
tions of categories were plotted for the clusters 
and for inferences made at the D level — i.e., 
those derived directly from the clusters them- 
selves. A correlation coefficient was used to 
determine the degree of ‘‘overlap’' between 
clusters and inferences, thus measuring their 
conceptual closeness. Although the correlation 


coefficients for both strong and weak clusters 
were good, inferences derived from strong 
clusters proved significantly more highly cor- 
related with their parent clusters than did 
inferences derived from weak clusters. The 
strong clusters scored .61, the weak ones .43, 
where the coefficient is on a scale from — 1 to 
-fl with 0 being the point of random correla- 
tion. 

Although these data support to some extent 
the trends indicated by the “Kogun” experi- 
ments, they also raise some interesting ques- 
tions which can only be answered by compara- 
tive data of the type outlined above in describ- 
ing our design for these experiments. One of 
these questions arises from the fact that the 
participants were able to recognize relations 
between members of a cluster and make infer- 
ences based on these relations even when the 
cluster was represented by large segments of 
text. Would the same apply for more loosely 
structured sets of data — if the 24 members of 
the Fi star, for example, were imbedded in suc- 
cessively larger segments of text, would the 
basic relation still be discernible, or would 
inferences derived be diffuse and genera! rather 
than highly correlated with the star from which 
they were derived? What sort of inferences 
would be derived by applying the layering tech- 
nique to a random selection of fragments re- 
lated only by coming from literature in the same 
field? 

A second type of question is more thought- 
provoking, and less readily answerable. It de- 
parts from the notion of an inference as a 
measure of the ability to assimilate and corre- 
late information and considers instead the na- 
ture of an inference. We have discussed the 
correlation betwen basic data and inferences 
derived from these data, but is it reasonable to 
expect such correlation to exist? Perhaps the 
mark of a good inference is that it exhibits 
little conceptual relatedness to the data from 
which it was derived. Again, what is a “good” 
inference? Should “good” be interpreted as 
“non-trivial”? How much information should 
be carried in an inference? A systematic ap- 
proach to these problems is explored in the 
following section. 
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5. QUANTITATIVE EVALUATION OF IN- 
FERENCES 

In developing a systematic procedure for 
evaluating inferences, it is convenient to think 
of the presented information set S as having 
three components : information content, volume, 
and structure. Let “s” denote the proposition 
expressed by the content of the set, and suppose 
that the analyst, after examining the set, pro- 
duces the inferences (hypotheses, conjectures) 
hi, , hn. By varying the structure, or volume, 
or content of S, different collections of infer- 
ences will be generated. The problem is then to 
develop a comparative evaluation of the ''good- 
ness” of the two collections. In order to do this 
we consider a quantitative evaluation based on 
the notion of a subjective probability function 
or credibility function or betting function. Such 
functions are discussed in References 1, 2, 3. 
We denote this function by "B” (for “betting 
function” or “belief”). Thus B(h, s) is the 
analyst's evaluation of the degree of belief in 
the hypothesis h after considering the evidence 
s. B(h) is his degree of belief in the hypoth- 
esis h prior to considerings. We suppose that 
B obeys the laws of probability so that B gives 
the value of the betting quotient (ratio of 
amount bet to total stake) that should be offered 
for a bet on the hypothesis in question.* In 
addition, the quotient should be psychologically 
fair in the sense that the bettor must alternately 
be willing to bet on the negation of the hypoth- 
esis with the complementary quotient. 

Let us suppose for the present that we actu- 
ally can obtain three fundamental values B(h), 
B(h, s), and B(s). Since the “goodness” of a 
collection of inferences should in some way 
depend on their validity, novelty, dependency 
on s, and non-triviality, let us see how these 
things can be defined in terms of the B-func- 
tion. 

It seems appropriate to define the novelty of 
h to be B(Tr), where h is the negation of h. 
Thus the novelty of h is taken to be 

B(h) =1 — B(h) 

i.e., the complement of the degree of belief in 
the inference before examining the supporting 

*Thus betting schemes based on B will not lead to a 
net loss in every possible case. (See Reference 1, 3.) 


data. We define the validity of the inference to 
be B (h, s) ; i.e., the degree of belief after exam- 
ining the supporting data. It appears that the 
“score” of an inference should increase with 
increasing validity and novelty and that the 
product of validity and novelty would therefore 
be a good choice for the score. On the other 
hand, an inference quite independent of s 
should score zero. Independence is character- 
ized by the condition 

B(h, s) ^ B(h) 

so that the product of validity and novelty for 
an independent inference is B(h). B(h). Thus, 
we propose to score inferences as the product of 
validity and novelty less the independence value 
of this product. That is to say, we take the final 
score to be 

G(h,s) [B(h,s)— B(h)] •B(h) 

The goodness of the inference is therefore 
measured as the product of the change in degree 
of belief and the novelty. 

Let us now test this scoring schema for vari- 
ous extreme conditions: (1) h is tautologous 
(e.g., h is “A is A”) ; (2) h is contradictory 
(e,g., h is “A is not A”) ; (3) h is trivial (e.g., 
h — s) ; (4) s logically implies h (deductive 
inference) ; (5) h is independent of s (the sup- 
porting data does not relate to h). These five 
cases are shown in the table below. As well as 
showing the notions of validity and novelty, we 
also include the notions of “strength of sup- 
porting data” and “relative validity.” 

We now examine how the procedure would 
apply to some actual data. We use the “Kogun” 
experiment discussed above. The hypotheses to 
be considered are: (hx) Japanese attack at 
night in a particular case; (ho) Americans at- 
tack in day in a particular case; (h^) Japanese 
almost always attack at night; (h 4 ) Americans 
almost always attack in day: (hr,) h^ and h 4 
(i.e., the assertion of the conjunction of ha and 
h 4 ) . The supporting data s is that given by the 
25 fragment cards. 

Since the source of the data is from the book 
“Kogun” and consists of supposedly factual 
reports, let us assign a rather high value to 
B (s) , say 0.9. Disregarding s, we consider each 
of the hypotheses hj and h 2 as quite plausible 



TABLE OF EXTREME CONDITIONS 


Notion 

Definition 

h is 

tautologous 

h is 

contradictory 

h is trivial 
(h = s) 

s logically 
implies h 

h is independent 
of 8 

strength of 

Supporting Data 

B(s) 

B(s) 

B(s) 

B(s) 

B(s) 

B(s) 

Relative Validity 

B (s and h) 

B(s) 

0 

B(s) 

B(s) 

B(s) B(h) 

Novelty 

B(h) 

0 

1 

B(s) 

B(E) 

B(h) 

Validity 

B(h, s 

1 

0 

1 

1 

B(h) 

(Novelty) x (Validity) 

B(h,s)B(h) 

0 

0 

B(i) 

B(E) 

B(h) B(E) 

Score 

[B(h,s) — B(h)] B(¥) 

0 

0 

B='(i) 

BMh) 

0 
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and, therefore, assign a high a priori degree of 
belief to both, say 0.9. Thus, the novelty of each 
is 0.1. The analysis of the hypotheses h^ and h 4 
is more subtle. Presuming we know nothing 
about the Pacific War, we imagine that perhaps 
the specific military situation would mean that 
sometimes night is better for attack, sometimes 
day. On the other hand, perhaps the general 
properties of the military situations of the kind 
discussed are such that night is always better 
or day is always better for an attack. This 
second consideration makes us think that B (ha) 
is near 0.5 (because we cannot choose between 
night and day). The first consideration makes 
us think that B(hH) is less than 0.5. Therefore, 
we take B(ha) = 0.4. Similarly B(h 4 ) = 0.4. 


The hypothesis hr., however, is completely sur- 
prising, so we take B(hr.) = 0.1. 

To estimate the validity, we note that hi and 
h 2 are contained in s. Thus, B (hi, s) = B (h 2 , s) 
= 1. For ha and h 4 , we note that s expresses 
frequency of occurrence of certain situations 
and that the relative frequency of occurrence of 
these situations as described in ha and h 4 is 
unity. Thus, we take B(h 3 , s) =: B(h 4 , s) = 

0.9 (slightly less than unity). Similarly, we 
take B (h 5 , s) close to 0.9 but somewhat less, say 
9.8.* The relative validity and scores can then 
be computed. The results are shown in the 
table belPw and are seen to be intuitively satis- 
factory. 


TABLE OF KOGAN INFERENCES 


Notion 

Definition 

hi 

h2 

ha 

hi 

h5 

Strength of supporting 
data 

B(s) 

.9 

.9 

.9 

.9 

.9 

Relative Validity 

B (s and h) 

.9 

.9 

.81 

.81 

.72 

Novelty 

B(h) 

.1 

.1 

.6 

.6 

.9 

Validity 

B(h,s) 

1.0 

1.0 

.9 

.9 

.8 

(Novelty) x 
(Validity) 

B (h, s) • B (h) 

.1 

.1 

.54 

.54 

.72 

Score 

[B(h,s)— B(h)]B(h) 

.01 

.01 

.30 

.30 

.63 


The problem of obtaining the B values is still 
open and it should be approached keeping the 
following considerations in mind. 

1. The inferences obtained often contain 
words indicating qualitatively the degree of 
belief (e.g., ‘fit appears that,’' “possibly,” “prob- 
ably,” etc.) ; there is evidence that there is a 
comparative notion of belief susceptible of 
quantization. 

2. There may be difficulty in obtaining an 
honest appraisal of B(h) after the analyst has 
examined the data s. Possibly this can be 
avoided by using a referee to make the ap- 
praisals. 

*The normative condition, B(hy & h^, s) = BChg, 
h 4 & s). B(h 4 , s), is considered in this appraisal. 


3. Comparative evaluations are desired and 
the quantitative evaluations are for this pur- 
pose. Thus consistency in the evaluations is 
required only to the extent of the comparative 
results. 

6. CONCLUDING REMARKS 

In this study, it was assumed : 

a. that a user requirement in certain infor- 
mation systems is the obtaining of data to 
form hypotheses 

b. that this ability to form hypotheses is 
dependent upon the assimilation and correla- 
tion of the data presented 

c. that the ability to assimilate and corre- 
late information is in turn dependent upon 
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the volume and structure of the information 
presented. 

A measure of assimilability called the critical 
volume was postulated based on the results of 
earlier information correlation experiments. 
An experimental design was developed to deter- 
mine the relation between this measure and the 
degree of structure in the data presented. A 
mathematical technique was used to produce 
two sets of structured data. Assimilation of 
information represented by these data was then 
tested by requiring experimental participants 
to derive inferences from the data presented. 
Inferences derived from more highly structured 
materials proved more highly correlated with 
the basic data set. 

Thus in the course of these experiments, 
methods for systematically structuring data 
and measuring assimilability were designed and 
tested on a small scale. In addition, a procedure 


for quantitatively evaluating hypotheses (i.e., 
the procedure for measuring assimilability) is 
proposed as a tool for a feedback improvement 
to the structuring procedure. Results of these 
experiments, although preliminary, are gen- 
erally positive, and suggest realistic approaches 
to further studies of information correlation. 
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INTRODUCTION 

The comparison between stored informa- 
tion identifications and requests for informa- 
tion is one of the principal tasks to be per- 
formed in automatic information retrieval. In 
so-called descriptor systems, where information 
is represented by sets of independent key words, 
this operation is relatively simple, since it con- 
sists of a comparison between the respective 
‘"vertors” of key words. In many retrieval sys- 
tems it has been found necessary or expedient 
to use more complicated constructs for the iden- 
tification of information. Notably '"role’' indi- 
cators are often added to identify various types 
of key words, and "links” specify a variety of 
relations between key words. A complete identi- 
fication for a document or an item of informa- 
tion is often represented by a graph, consisting 
of nodes and branches between nodes, to iden- 
tify respectively the key words and relations be- 
tween key words.i*2,3 matching of such 

information graphs with graphs representing 
requests for information is a relatively compli- 
cated and time consuming operation, particu- 
larly since the request structure can be made 
to match the information structure only par- 
tially and incompletely. 

The graph matching problem arises also in 
document retrieval systems where certain sig- 
nificant portions of text are extracted and com- 
pared with the search requests. In such cases, 


it is possible to represent the syntactic struc- 
ture of ^he text excerpts by abstract trees, and 
a tree matching procedure becomes necessary 
to compare the extracted information with the 
requests.^’® As before, an exact matching proce- 
dure would not be very helpful, since many dif- 
ferent ways can be found to express the same 
ideas or requests. What is needed instead, is a 
procedure which permits inclusion of partly un- 
specified information, and which provides for 
the possible relaxation of the various conditions 
that render a complete match impossible at any 
given time. Graph matching techniques are, of 
course, also applicable to the comparison of 
items of information which exhibit inherently 
a multi-dimensional structure, such as electrical 
or pipeline networks, street or geographical 
maps, chemical molecular structures, and so on. 

Extensive experience has been gained in the 
past with structure matching programs which 
operate on a "node-by-node” or a “piece-by- 
piece” basis.^''^'* In the node-by-node approach, 
the nodes of the two structures are compared 
one at a time, until either the complete struc- 
tures match, or else an incompatibility arises; 
in the latter case it becomes necessary to back- 
track to a point where there is agreement and 
try again with different elements. In the piece- 
by-piece approach, a dictionary of basic sub- 
structures is used to break a given structure 
into pieces which are then matched as a whole. 


* This study was supported in part by the National Science Foundation under Grant GN-82. 
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Neither of the two techniques works well for 
any but the simplest structures. The node-by- 
node method usually requires extensive back- 
tracking, involving the comparison of many 
hundreds of nodes for even very simple struc- 
tures. The piece-by-piece approach, on the 
other hand, suffers from the fact that no stand- 
ard, well-defined method exists for breaking a 
given structure into substructures. 

A topological structure-matching procedure 
has been programmed for the 7090 computer 
which does not depend on a specific ordering of 
the nodes, nor on the presence or absence of 
certain specified substructures. Little or no 
backtracking is required, and the method can 
be used to detect complete as well as partial 
matches. The basic idea is to determine certain 
simple properties of the nodes of the two struc- 
tures to be matched, and to equate, those subsets 
of the nodes in the two structures which exhibit 
equivalent properties. A standard procedure is 
then used to form new matching subsets, and 
to break down already existing subsets into 
sets with fewer members. The procedure is 
completely determinate except in cases where 
it is necessary to resolve certain symmetries in 
the connection pattern of the nodes; in that 
case a guess (assignment) is made as to the 
correct solution; such a guess may later prove 
to have been right or wrong, and if wrong, may 
require some backtracking. In most practical 
problems, however, little backtracking seems 
to be needed.t Computer experiments indicate 
that the topological procedure is much more 
efficient than either the ‘‘node-by-node'' or the 
“piece-by-piece" approach. Operating auto- 
matic retrieval systems based on the use of 
relatively complex structures (as opposed to 
sets of unconnected key words) seem therefore 
to become a practical possibility instead of 
merely a theoretically desirable goal. 

An example is given first to illustrate the 
partial matching procedure, as well as the 
methods which may be used to alter one or both 
of the structures to be compared in order to 
make a match between them more likely. The 
procedure is then applied to the matching of 

t A related strategy has been used by Unger to detect 
complete, rather than partial, isomorphisms between 
directed graphs.9 


document graphs with request graphs, and to 
a retrieval system based on the comparison of 
syntactically analyzed document excerpts with 
a stored phrase dictionary. 

THE STRUCTURE MATCHING 
PROCEDUREt 

Consider first the problem of determining 
whether the graph of Fig. 1(b) is contained in 
the graph of Fig. 1(a), that is whether Fig. 
1 (b) is a subgraph § of Fig. 1 (a) . The nodes 
in the two structures are labelled arbitrarily 
from ® to (5) and from (a) to (c) respectively, 
and the connection pattern of the nodes is rep- 
resented by the binary connection matrices! 
shown in Figs. 2(a) and 2(b). Since no addi- 
tional information is furnished about either 
the nodes or the branches of the two graphs 
under consideration, all relevant properties of 
these graphs are in fact derivable from the con- 

O ® @ 



Figure 1. Directed Graphs. 


t Theoretical foundations and proofs of convergence 
of the method are given in detail in Reference 10. The 
theory as well as applications to chemistry are also 
more fully treated in References 11 and 12. 

i A graph G consists of a set X (the nodes) and a 
set of relations between certain pairs of nodes (the 
branches). A matrix C such that C/ = 1 whenever 
there is a branch from node Xj to node Xj, and is 0 
otherwise, is called the connection matrix of graph G. 
A subgraph H of G is obtained by removing from G 
certain nodes as well as all branches adjacent to the 
removed nodes. A partial graph J of G is obtained by 
removing from G some of its branches. A partial sub- 
graph K of G is a subgraph of a partial graph of G, 
A completed partial subgraph L of G is a partial sub- 
graph to which branches are added so as to preserve 
all original paths between the nodes included in L; 
specifically, if node z is removed from G in forming L 
and if there exist paths from x to z and from z to y in 
G, then a path exists from x to y in L for all x, y 
included in L. 
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Figure 2. Connection Matrices for Graphs of Figure 1. 


Set 

Number 

Criterion for Set Formation 

Corresponding Sets 

XIX' 

Assignment 

{a}={3} 

XX' 

Outward connections, sets XIX' 

{b,c}={2,5} 

XXI' 

' Partition 

{a)={3} 

XXII' 


{b}={2} 

XXIII' 


{c}= - 


Figure 3b. Set Correspondences Resulting from 
Improper Assignment. 


nection matrices. The computer program is 
therefore based on the manipulation of binary 
matrices of the type shown in Fig. 2. 

The heart of the algorithm consists in using 
various properties of the nodes and/ or branches 
of the graphs in order to generate pairs of sets 
which must match if the two graphs are even- 
tually to match. The following properties are 
particularly useful for this purpose : 

a. the order outward (or inward) degree 
of the nodes, that is the number of nodes 


Set 

Number 

Criterion for Set Formation 

Corresponding Sets 

I 

Outward degree 1 

{b} c{i, 2,3,4, 6, 8,9} 

11 

Outward degree 2 

{a} £{2,3,4, 9} 

III 

Inward degree 1 

{b} £{2,4,5, 6,7,8} 

IV 

Inward degree 2 

{c}£{2,5,7,8} 

V 

Partition 

{a} £ {2,3,4,9} 

VI 


{b}£ (2, 4,6,8} 

VII 


{c} £ {2,5,7,8} 

VIII 

Outward connection, sets V 

{b,c) £ (2,4,5,6,7,8} 

IX 

sets VI 

{c} £(2,6, 7, 8} 

X 

Inward connection, sets VI 

{a}£ {1,2,3,4,9) 

XI 

sets VII 

{a,b}£ {1,2,3,4, 6,8,9} 

XII 

Partition 

{a}£ {2, 3,4,9} 

XIII 


{b}£ {2,4,6, 8} 

XIV 


{c}c {2,7,8} 

XV 

Inward connection, sets XIV 

{a,b}c {1,2,3, 4,6,8, 9} 

XVI 

Partition 

{a} c {2A4,9} 

XVII 


{b}c {2,4,6,8} 

XVIII 


{c) £ {2,7,8} 

XIX 

Assignment 

{a}={2} 

XX 

Outward connection, s^ XIX 

{b,c}c {6,7,8} 

XXI 

Partition 

{a}={2} 

XXII 


|b}E{6,8} 

XXIII 


{c}c{7,8} 


Figure 3a. Correspondences Formed for the Graphs 
of Figure 1. 


reachable from a given node by outgoing 
(or incoming) paths of length k; 

b. labels or identifiers which may be associ- 
ated with nodes or branches ; 

c. the connectivity patterns of sets of nodes, 
that is, the nodes reachable from a given 
set of nodes by paths of length k. 

The procedure for the graphs of Fig. 1 is out- 
lined in Fig. 3. Only connections of length 1 
have been used to simplify the exposition. 

The initial set correspondences are shown in 
Fig. 3(a), lines I to IV. Set II, for example, is 
constructed by noting that the set of all nodes 
of outward degree 2 in the Query structure, 
must correspond to the set of all nodes having 
at least outward degree 2 in the dictionary 
structure. The only node of outward degree 2 
in Fig. 1(b) is @ ; there are four nodes in 
Fig. 1(a) that have outward degree 2 or 
greater ; @ must therefore correspond to either 
nodes ( 2 ), @ or 

At this point it is necessary to generate 
smaller sets from the ones shown on lines I to 
IV of Fig. 8(a). This is done by noting, for 
example, that the set of query nodes contained 
in both sets I and III of Fig. 3(a) can corre- 
spond only to dictionary nodes which are also 
contained in sets I and III (plus possibly in 
other sets) . The only query node contained in 
both sets I and III is ® ; in the dictionary 
structure, nodes ( 2 ), ®, ®, and ® are both in 
sets I and III, so that set {b} must be contained 
in sets (2, 4, 6, 8}. The set of possible corre- 
spondents of node ® has then been reduced 
from the seven nodes of set I to the four nodes 
of set VI. This ‘'set partitioning'' procedure is 
performed by the computer by comparing the 
columns of the binary matrices exhibited in 
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SETS 

NODES 1 

a 

b 

c 

1 

2 

3 

4 

5 

6 

7 

8 

9 

I 

0 

1 

0 

1 

1 

1 

1 

0 

1 

0 

1 

1 

n 

1 

0 

0 

0 

1 

1 

1 

0 

0 

0 

0 

1 

HI 

0 

1 

0 

0 

1 

0 

1 

1 

1 

1 

1 

0 

i!L_ 

0 

0 

1 

0 

1 

0 

0 

1 

0 

1 

1 

0 


Figure 4. Matrix Representation of Sets 
I-IV of Figure 3a. 

Fig. 4. Node® , for example, has column vector 
1010 ; the only nodes of the dictionary structure 
including the pattern 1010 are nodes ( 2 ), ©, ® 
and ® with vectors 1111, 1110, 1010, 1011 re- 
spectively. 

The partitioning process yields three new sets 
labelled V, VI and VII. New sets (VIII to XI) 
are added using outward and inward connec- 
tions of length 1 from the set^ V, VI, and VII. 
(The outward connection of the set {c} is 
empty, and therefore it is not included in the 
table.) The partitioning process is repeated, 
yielding sets XII, XIII, and XIV. Since the 
possible correspondents of node © have 
changed, it is not redundant to test the con- 
nectivity again. When this is done and another 
partition performed, sets XVI, XVII, and 
XVIII result. These pairs of sets are identical 
to those produced by the previous partitioning, 
and no simple properties of the nodes can be 
used at this point to generate new sets which 
would in turn result in a reduced partition.il 
An “assignment'' is therefore made by postu- 
lating the correspondent for node (® (set XIX) . 
This assignment represents a guess which must 
later be verified for correctness. Partitioning 
of the sets XVI to XX yields the sets XXI to 
XXIII. New assignments (not shown in Fig. 
3(a) ) of ® first to ® and then to ® then pro- 
duce two one-to-one correspondences between 
the graphs of Fig. 1 : 


(D 

© 


© 

© 

© 


and i 


r® 

Q 


® 

® 


If it is desired to obtain other possible cor- 


II The problem of choosing a ‘*good” property set to 
be used for the generation of set correspondences is, in 
general, unsolved. For the example at hand, additional 
properties might, however, have been used. For in- 
stance, second order inward degrees yield the corre- 
spondence c — 2, 6, 7 after the first partition. 


respondences, it is now necessary to go back 
to the sets XVI to XVIII and attempt other as- 
signments for node @ . Sets XIX' to XXIIT of 
Fig. 3(b) illustrate the assignment® ®. 
This assignment yields a partition which is seen 
to be improper since node © cannot be included 
in the empty set. The assignment a @ is 
therefore not useful since it leads to an in- 
compatibility. The other two possible assign- 
ments for node® , do, however, furnish accept- 
able one-to-one correspondences as follows: 


© 

© 


© 

© 


and J 


© 

© 


'© 

®. 


^he four mappings obtained are easily verified 


A flowchart of the complete procedure is 
shown in Fig. 5. The procedure is seen to be 



CARDINALITY VIOLATION 


Figure 5. Simplified Diagram of Graph 
Matching Procedure. 


iterative since the generation of corresponding 
sets is followed by a partitioning process, fol- 
lowed again by the generation of new sets, and 
so on. Alternate applications of partitioning 
followed by formation of new sets will result 
in one of three situations : 

1. the membership of each set is reduced to 
one, thus exhibiting the complete match 
between the given structures ; 

2. an incompatibility arises between pairs of 
corresponding sets, that is, a cardinality 
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violation is found to exist between pairs 
of corresponding sets; in that case no 
match exists in general ; 

3. no incompatibility arises but repeated ap- 
plication of the partitioning procedure 
will not result in the formation of new 
sets ; in that case, more than one match 
is generally possible, and it is necessary 
to perform an arbitrary assignment of 
correspondents for one of the nodes. 

If a cardinality violation is detected, that is, 
if, for example, a set A is found to be included 
in a set B which has fewer members, as hap- 
pened in the example for sets XXIIF, then the 
two structures being compared obviously can- 
not match. The comparison process can there- 
fore be stopped immediately, unless the incom- 
patibility resulted from a previous assignment; 
in the latter case, only that particular assign- 
ment can be discarded, and other possible as- 
signments must be tried before deciding that 
the two structures do or do not match. The 
procedure to be followed in case of cardinality 
violation is shown by broken lines in Fig. 5. 
In practice, cardinality violations normally 
arise early for graphs which do not match, so 
that the procedure is very rapid in such cases.# 

THE ADAPTIVE MATCHING PROCESS 

In the example described in the preceding 
section, four isomorphisms were detected be- 
tween the graph of Fig. 1(b) and that of Fig. 
1(a). Clearly, it is possible to increase or de- 
crease the number of matches (or, alterna- 
tively, to increase or decrease the probability 
of a match between any two given structures) 
by suitably relaxing or tightening the condi- 
tions which affect the matching process. If, for 
example, the unilateral connections (directed 
branches) in Fig. 1 are replaced by bilateral 
connections, and therefore the non-symmetric 
connection matrices of Fig. 2 are changed into 
the symmetric ones of Fig. 6, then eight addi- 
tional isomorphisms will be found between the 
two graphs. In fact, three different isomor- 
phisms will then exist between the “triangle” 
(a, b, c} and each of the triangles {2, 6, 7}, 

# Non-matching graphs of fifty nodes required an 
average of less than one-half millisecond on the 7090 
computer during a test run. 
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a) DICTIONARY STRUCTURE b) QUERY STRUCTURE 

Figure 6. Symmetric Connection Matrices Derived 
from Graphs of Figure 1. 


{2, 8, 7}, {4, 2, 8} and {9, 4, 8} included in the 
dictionary structure. 

Another possible way of relaxing the condi- 
tions which are operative during the matching 
process is to permit the introduction between 
any two nodes in the query structure of a varia- 
ble number of intermediate nodes. This process 
replaces the query structure of Fig. 1(b) by the 
new structures of Figs. 7(b) and 7(d) , (The 
broken lines indicate indirect connections.) 
Since each of the intermediate nodes may or 
may not match a given node in the dictionary 
structure, it is now necessary to test whether 
the query structure is a completed partial sub- 
graph (rather than a subgraph) of the diction- 
ary structure. A comparison of Fig. 7(b) with 
the partial subgraph of Fig. 1(a) represented 
as Fig. 7(a), and a comparison of Fig. 7(d) 
with Fig. 7 (c) reveals at least two additional 
completed partial subgraph matches that could 
be obtained in addition to the four subgraph 
matches already exhibited in the preceding sec- 
tion. Further completed partial subgraph 
matches not shown in Fig. 7 are also possible. 



o) PARTIAL SUBGRAPH OF b) MODIFIED QUERY 

DICTIONARY STRUCTURE OF STRUCTURE 

FIGURE 1 



c) PARTIAL SUBGRAPH OF d) MODIFIED QUERY 

DICTIONARY STRUCTURE OF STRUCTURE 

FIGURE 1 


Figure 7. Matching Partial Subgraphs. 
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b) QUERY STRUCTURE 


Figure 8. Path Matrices (Including Indirect 
Connections) for Graphs of Figure 1. 


To determine whether a graph is a completed 
partial* subgraph of another graph, it is no 
longer sufficient to know whether two nodes 
are directly connected or not, but it is also nec- 
essary to know whether a (possibly indirect) 
path exists between any pair of nodes. Thus 
the connection matrices of Fig. 1 must be re- 
placed by the “path matrices"’ shown in Fig. 8 
in which the i-j^^' element is 1 whenever a path 
exists from node i and to node j.** Each 1 in 
the matrices of Fig. 8 thus indicates either a 
direct or an indirect connection between the 
corresponding nodes, and use of the algorithm 
of Section 2 with the path matrices of Fig. 8 
(instead of the connection matrices of Fig. 2) 
will generate the isomorphisms derived in Figs. 
3 as well as a number of additional completed 
partial subgraph matches including those ex- 
hibited in Fig. 7. 

Consider now, on the other hand, methods 
which will tighten the requirements to be met 
for a satisfactory match. Instead of specifying 
less information than for the directed subgraph 
comparison, it is now necessary to add restric- 
tions to the graph of Fig. 1. A possible method 
consists in adding labels to the unlabelled 
branches of the graph to simulate, for example, 
various types of relations between the nodes. 
Another possibility is the addition of labels to 
the nodes of the graph so as to restrict the 
correspondents of a given labelled query node 
to only those nodes in the dictionary structure 
which carry the same label. 

Consider first the two graphs shown in Fig. 


** The path matrix may be generated automatically 
as a sum of powers of the corresponding connection 
matrix.13 
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a ) DICTIONARY STRUCTURE b) QUERY STRUCTURE 

Figure 9. Abstract Graphs Including Branch Labels. 


9. Clearly these graphs are identical with those 
shown in Fig. 1, except for the added branch 
labels which distinguish three types of relations 
denoted respectively by the digits 1, 2, and 3. 
The binary connection matrix of Fig. 2 may 
now be replaced by a numeric branch label 
matrix as shown in Fig. 10, whose i-j^*‘ element 
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Figure 10. Branch Label Matrix for Graphs of Figure 9. 


is n if there exists a branch of type n from node 
i to node j, and is 0 otherwise. In the set gen- 
eration and partitioning procedure, it is then 
possible to keep with each node a list of branch 
labels of all outgoing (or incoming) branches. 


Set 

Number 

Criterion for Set Generation 

Corresponding Sets 

I 

Outward degree 2 (or greater) 
and outward branch label (3,3) 

fa} c(2,9} 

11 

Outward degree i (or greater) 
and branch label (2) 

(b) c{ 1,2, 4,6,8) 

III 

Inward degree 2 and branch 
label (2,3) 

fc) c {2,7,8} 

IV 

Connections from set I includ- 
ing branch label (3) 

{b,c} c{4,7,8} 

V 

Connections from set 11 and 
branch label (2) 

{c}c{2,6,8.7} 

VI 

Connections into set III and 
branch label (2) 

{b}C{i.6,8,4} 

VII 

Connections into set III and 
branch label (3) 

{a} c {3,4, 2.9) 


Figure 11a. Set Correspondences for Graphs 
of Figure 9. 
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a 

b 

c 

1 

2 

3 

4 

5 

6 

7 

8 

9 


I 

1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

1 


II 

0 

1 

0 

1 

1 

0 

1 

0 

1 

0 

1 

0 

{a) £{2,9} 

II 

0 

0 

1 

0 

1 

0 

0 

0 

0 

1 

1 

0 

{b}C{4,8} 

V 

0 

1 

1 

0 

0 

0 

1 

0 

0 

1 

1 

0 

(c) c{7,8} 

V 

0 

0 

1 

0 

1 

0 

0 

0 

1 

1 

1 

0 


VI 

0 

1 

0 

1 

0 

0 

1 

0 

1 

0 

1 

0 


11 

1 

0 

0 

0 

1 

0 

1 

0 

0 

0 

0 

1 



Figure 11b. Set Inclusion Matrix for Sets of Figure 11a. 


and, obviously, given a pair of corresponding 
sets not only must the nodes match as before, 
but the branch labels must match as well. The 
matching procedure is illustrated in Fig. 11. 


The branch labels make it possible to gen- 
erate a large number of sets at the outset. The 
set partitioning procedure illustrated by the set 
inclusion matrix of Fig. 11(b) then results in 
the formation of the three small sets reproduced 
in the figure. Assignment of (a) to either node 
@ or node ® finally produces two one-to-one 
mappings as follows : 


© 

© 




and j 


© 

© 


@ 

® 

© 


A comparison of the graphs of Fig. 9 can be 
used to verify that these two mappings are the 
only ones which obey the branch labelling re- 
strictions. 


As a last extension, consider now the two 
Syntol graphs^ of Fig. 12. These graphs cor- 
respond to an actual document abstract and to 
a search request, respectively, as encoded under 
the Syntol system, and may be seen to be iden- 
tical with the structures of Fig. 9 except for 
the added node labels. In order fully to repre- 
sent the system, it is no^w necessary to add node 
label matrices to the connection matrices and 


©AUGMENTATION 



Figure 12a. Typical Syntol Document Graph. 



Figure 12b. Typical Syntol Query Graph. 


to the branch label matrices. The node label 
matrices may be represented either as a table 
including all the node names together with the 
(possibly vacuous) corresponding labels, or 
alternatively as a full matrix whose ele- 
ment is 1, whenever label j is attached to node 
i. The node labels serve the same purpose as 
the branch labels, since they restrict the num- 
ber of possible correspondents of a given node 
to only those nodes which either carry the same 
label, or else carry no label, thus indicating that 
they can match any node whatsoever that satis- 
fies the remaining restrictions. 

The procedure used to determine whether the 
query graph of Fig. 12 is a subgraph of the 
document graph is outlined in Fig. 13. Since all 
nodes are labelled, an immediate correspond- 
ence is established between the nodes of the two 
graphs under consideration (sets I, II and III 
of Fig. 13). It remains to determine whether 
the connections and branch labels are preserved. 
Sets IV of Fig. 13 reveal an incompatibility, 
since node @ has two outgoing branches with 
a branch label (3), whereas the corresponding 
node ® has only one such outgoing branch. 
Since a set containing two elements cannot be 
contained in a set containing only one element. 


Set 

Number 

Criterion for Set Generation 

Corresponding Sets 

I 

Node label “cortex” 

{a} £{3} 

II 

Node label “nerf optique” 

{b)c{8} 

III 

Node label “chronaxie” 

{c}£{7) 

IV 

Direct connections from set I 
with branch label (3) 

{b.c)c(2} 


Figure 13. Set Correspondences for Graphs of 
Figure 11 Using Direct Connections. 
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Set 

Number 

Criterion for Set Generation 

Corresponding Sets 

I 

Node label “cortex” 

{a}c{3} 

11 

Node label “nerf optique” 

{b}C{8} 

III 

Node label “chronaxie” 

{c)c{7} 

IV 

Paths from I with branch label : 
(3) 

{b.c}c {2,7,8} 

V 

Paths from II with branch label 
(2) 

{c}c{7} 

VI 

Paths into III with branch label : 
(2) 1 

{b}c{i,2,4,6,8} 

VII 

Paths into III with branch label ■ 
(3) 

{a) c {2, 3, 4, 9} 

VIII 

Paths into II with branch label 
(3) 

(a) c (2, 3, 4, 9} 


Figure 14. Set Correspondences for Graphs of 
Figure 11 Using Indirect Connections. 

the subgraph test fails, and there is no need to 
proceed further. 

It is therefore necessary to relax the match- 
ing conditions by taking into account indirect 
connections and intermediate nodes. The path 
matrix of Fig 8 is now used to verify that com- 
plete paths (rather than direct connections) 
and path labels are preserved by the corre- 
spondence in the node labels. The set corre- 
spondences, shown in Fig. 14, demonstrate that 
to each outgoing and incoming labelled path in 
the query structure there corresponds a path 
with similar properties in the document struc- 
ture, The query graph therefore matches the 
document graph when indirect connections are 
taken into account. 

An adjustable procedure for the comparison 
of query and dictionary structures can now be 
outlined. The process uses the same matching 
algorithm throughout, and is modified only by 
altering the matrices which represent the con- 
nection patterns and the branch or node labels. 
The exact strategy used in the progressive 
alteration of the matrices may be made to de- 
pend on the type of document collection being 
processed, and on preliminary retrieval tests. 
Clearly, the weaker the restrictions which affect 
the matching process, the more matches are 
likely to be obtained, and the larger therefore 
the collection of answers to a given search re- 
quest. 

In general, elimination of the branch labels 
from the query and document graphs reduces 


a variety of possible relations between terms 
to a single one (represented by an unlabelled 
branch). Replacement of directed by non-di- 
rected branches further reduces the ability to 
discriminate between a variety of relations, 
since a relation from A to B is now equivalent 
to one from B to A. Finally, removal of node 
labels simplifies both the search requests and 
the document identifications, since it eliminates 
from consideration some of the terms used as 
identifiers. 

A possible strategy for the gradual broaden- 
ing of matching criteria is as follows : 

a. Use unmodified query structure Q and 
dictionary structure D and test whether 
Q is a subgraph of D ; 

b. If the preceding test is negative use path 
matrix including indirect connections to 
determine whether Q is a completed par- 
tial graph of D ; 

c. If the preceding test is again negative, 
selectively remove branch labels by alter- 
ing branch label matrix and test again 
using first only direct connections (sub- 
graph test), then indirect connections; 

d. If matching conditions must be further 
relaxed, replace unilateral by bilateral 
connections and use symmetric connec- 
tion matrices first with direct and then 
with indirect connections ; 

e. Finally, selectively remove node labels and 
test again for subgraph and then for in- 
complete partial graph. 

A retrieval system using graph matching 
procedures in conjunction with natural lan- 
guage data is outlined in the next section. 

A SENTENCE MATCHING PROCEDURE 
FOR DOCUMENT RETRIEVAL 

A simplified automatic document retrieval 
system is shown in Fig. This system 

makes use of the standard statistical proce- 
dures, including the computation of word fre- 
quency counts, word associations based on co- 
occurrence in the same sentences or texts, docu- 
ment associations based on co-occurrence of 
words, and document relevance coefficients.'*'^’^^ 
In addition, a dictionary or thesaurus may be 
used if available to normalize the vocabulary. 
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SIMPLIFIED SYSTEM USING STRUCTURAL MATCHING 
FIGURE 15 

Figure 15. Simplified System Using 
Structural Matching. 

The quantitative procedures may be supple- 
mented by choosing a set of significant sen- 
tences, as determined by the statistical process, 
and using them to perform a structural analysis. 
Specifically, each word is furnished with one or 
more thesaurus category numbers (the seman- 
tic labels) as a result of the dictionary look-up 
procedure. If no dictionary is available, each 
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word can of course function as a semantic label 
by itself. A syntactic analysis is then per- 
formed which determines a dependency struc- 
ture for the words of a sentence, and also gen- 
erates a syntactic label for each word. A typical 
dependency tree, resulting from an automatic 
syntactic analysis, is shown in Fig. 16. A syn- 
tactically analyzed sentence can of course be 
represented as before by direct and indirect 
connection matrices, as well as syntactic and 
semantic label matrices; moreover, these ma- 
trices can be generated automatically from the 
output furnished by the syntactic analysis pro- 
gram. 


It is now possible to compare the set of ana- 
lyzed sentences or search requests with a set of 
''criterion phrases’" included in a phrase dic- 
tionary. Each criterion phrase is representa- 
tive of one or more subject categories, and if 
a match is obtained between a criterion phrase 
and an analyzed sentence or search request, the 
corresponding subject categories can be at- 
tached to the matching sentences or requests. 
To retrieve a set of documents in answer to a 
given search request, it is then sufficient to 
compare the subject categories attached to the 
requests, w'ith the subject identifiers attached 
to the documents as outlined in Fig. 15. 


A typical criterion phrase is shown in Fig. 
17. Each criterion pHrase is represented, as 
before, by an identification number and control 



Figure 16. Typical Syntactic Dependency Tree. 
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THESAURUS 
CATEGORY 014: 


( INFORMATION, \ 
EXXJUMENTCS), 
FACT(S), I 
DATUM, DATA 
ETC. / 


THESAURUS 

CATEGORY 017 /RETRIEVAL, \ 
I PROCESSING, 
ORGANIZATION, 
\ SEARCH, ETC / 


Figure 17. Typical Criterion Phrase. 


information, the direct and indirect connection 
matrices, the syntactic and semantic node label 
matrices, and the category indicators which 
identify the subject classes for the given phrase. 
The semantic node labels attached to the sample 
criterion phrase are decoded in Fig, 17, 

The matching process between a given crite- 
rion phrase and a sample sentence or search 
request is identical with that used in the pre- 
ceding section for document graphs. That is, 
two principal criteria must be satisfied : 

1. Given a specified node of the criterion 
phrase, all those sentence nodes are se- 
lected which have matching syntactic and 
semantic labels ; 

2. From among those sentence nodes which 
obey the restriction of part 1, some subset 
must be chosen whose direct (or indirect) 
connection pattern is identical with the 
connection pattern of the corresponding 
nodes in the criterion phrase. 

Consider, as an example, the criterion phrase 
of Fig. 17 and the sentence of Fig. 16. Clearly, 
both syntactic and semantic labels of nodes @ 
and ® , and ^f nodes ® and @ will match 
properly. However, there exists a path from 
node a to node (B) in the criterion phrase, 
while no such path exists from node @ to node 
©. Therefore, the structure matching procedure 
will not be successful for the given example. 
On the other hand, it can be easily verified that 
the trees of Fig. 18 will, in fact, propel'ly match 
the criterion phrase of Fig. 17. 

Several methods are provided in the system 
for adjusting the matching process. First, the 
matching algorithm itself is adaptable, since 
node and branch labels can or cannot be taken 
into account, and direct as well as indirect con- 
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TREE STRUCTURES WHICH MATCH THE 
CRITERION PHRASE OF FIGURE 17 

FIGURE 18 

Figure 18. Tree Structures Which Match the Criterion 
Phrase of Figure 17. 


nections can be used. Second, it is possible to 
provide the criterion phrases with a smaller or 
larger number of syntactic and semantic labels, 
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thus restricting or enlarging the possible sen- 
tence nodes which are compatible. Finally, the 
thesaurus which can be used to replace text 
words by thesaurus categories, as well as the 
criterion phase dictionary can be enriched 
to ensure inclusion of a larger variety of pos- 
sible sentence structures. The system is pres- 
ently being tested in order to determine the 
practical effectiveness of these various meas- 
ures. 
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SOME IMPROVEMENTS IN THE TECHNOLOGY OF 
STRING MERGING AND INTERNAL SORTING 

Martin A, Goetz 
Applied Data Research, Inc, 

Princeton, N. J. 


GENERAL 

Sort/merge programs for magnetic tape com- 
puter systems are of two basic classes;^ 

Digital (or Radix) and 
Collation 

The digital sort is useful in only a limited 
number of cases and is not examined in this 
paper. 

The collation sort is the more general type 
of sort and is composed of two basic sub-pro- 
grams : 

1. The first sub-program internally sorts a 
group of data. The group of data after 
being sorted is referred to as a “string’’ or 
“initial string.” Such a sub-program is 
referred to as an internal sort. 

2. The second sub-program merges two or 
more strings. It will produce as output 
longer strings and will eventually produce 
one string which contains all the data. Such 
a sub-program is referred to as a “merge,” 
or “string merge.” 

The input data enters the internal sort sub- 
program only once and the merge sub-program 
one or more times. The various sorting sys- 
tems in use today all try to minimize the execu- 
tion time of the merge sub-program. This is 
accomplished by maximizing the way of the 
merge while at the same time keeping input 


strings the same size thus resulting in an 
“effective” power of the merge equal to the way 
of the merge.* The read-forward Oscillating 
Technique described in this paper is directed 
toward this goal. 

The Von-Neuman (T/2),^ Polyphase,^’ ^ and 
Cascade^ merge techniques begin after all the 
data is processed by the internal sort sub- 
program and the initial strings are placed on 
tape (Figure 1). The Von-Neuman Technique 
places the initial strings on only half the avail- 
able tapes; the Polyphase and Cascade place 
data on all the unused tapes. Depending on the 
number of tape units, one of the merging tech- 
niques will prove superior over the others. The 
read-backward Oscillating Technique® writes 
several initial strings, transfers to the merge 
sub-program, merges the strings and returns 
to the internal sort — Whence the name Oscillat- 
ing (Figure 2). As the number of tape units 
used for sorting increase, the effectiveness of 
the Oscillating Sort increases. Given T input 
tapes, the effective power of this technique is 
T-2. This technique, previously considered only 
for read-backward tape systems, is developed 
for read-forward only tape systems as described 
in the following pages. 


* For a distinction between “way of the m?rgc” and 
“effective power of the merge” please see glossary of 
sort /merge terms. 
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ExaiQ)I« - 6 tapes 

16 strings 



Figure 1. 

Distribution of Strings During Internal 
sort for Von Neuman (N/2), Cascade and Polyphase 
Merge. 

READ-FORWARD OSCILLATING 
MERGE 

The read-forward Oscillating Merge allows 
the use of a preceding internal sort technique 
which produces variable size string lengths 
(e.g., Replacement-Selection”). It also allows, 
of course, techniques which produce fixed-size 
strings (e.g., successive merging')- 

This paper presents an example in which the 
internal sort is a Replacement-Selection. The 
Replacement-Selection Technique used as an 
internal sort produces an initial string length 
almost twice the size of the memory available 
for sorting. Since the Oscillating Merge Tech- 
nique must merge initial strings formed by this 
technique, it is important that we review the 
nature of the output produced by the Replace- 
ment-Selection Technique. 

Given a memory for building strings that 
can hold R records, the expected string length 
for random data for the first string^ is 1.73R; 
for successive strings, 2.0R; without replace- 



Figure 2. 

Tape Motion of Read-Backward Oscillating Merge. 


ment. l.OOR. Given T tape units, the string on 
the first tape is estimated to be 1.73R; for the 
last tape unit, l.OOR, for all other tape units, 
2.00R. This variability in string length re- 
duces the advantages of the read-backward 
Oscillating Technique.® As will be demon- 
strated, this variability also reduces the effec- 
tiveness of the read-forward Oscillating Tech- 
nique. 

The read-forward Oscillating Technique can 
also be used with an internal sort that produces 
initial strings of a fixed size. In this case, the 
initial string length is l.OR for all tape units. 

THE TECHNIQUE 

It is more convenient to describe the tech- 
nique when the initial string lengths produced 
by the internal sort are fixed in length (l.OR), 
It will then be shown how the read-forward 
Oscillating Merge operates when used with the 
Replacement-Selection Technique in which 
variable length strings are produced. 

Given N tape units (Ti to Tn) available as 
work tapes. (In the example shown in Figure 
3, N is equal to 5.) 
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Tape Motion of Read-Forward Ossillating Merge. 


1. Initial strings are written on all tape units 
(Ti to Tx). After each string is written 
on successive tape units, the tape is re- 
wound. The last tape unit is not rewound 
(Figure 3A). 

2. Tape units Ti to Tx-i are merged onto tape 
units Tx, at which time all tape units are 
rewound (Figure 3B). 

3. As soon as tape unit T, is rewound (this 
tape unit contains only one string of data), 
a string is written on tape unit Ti and then 
it is rewound. Strings are consecutively 
written on tape units T 2 to Tx-i. All tape 
units except Tx-i are immediately rewound 
(Figure 3C) . Tx-i is not rewound. 

4. A merge of Ti to Tx -2 and Tx is performed 
onto Tx-j. Then all tapes are rewound 
(Figure 3D). 

5. The cyclings shown above continue for N 
cycles at which time all tape units contain 
a string length of size (N — 1) times the 
initial string length (Figure 3E) . 

6. The (N — 1) size strings are merged, the 
power being (N — 1) as shown in Figure 
3F. 

7. The pattern shown continues until the 
data is exhausted, at which time one 
string will be formed, similar to the read- 
backward Oscillating Technique. 

Note that after 16 strings were merged, the 
rewind interlock time was equal to rewind time 
for N — 1 strings. This time is a relatively small 
portion of the rewind time. Note that rewind 
time is minimized by starting the development 
of the next string in a group, although it will 
not be merged until the next cycle (String 21 
and Strings 17-20). 

When using the Replacement-Selection Tech- 
nique in conjunction with the read-forward 
Oscillating Technique, initial strings are con- 
strained not to exceed twice the number of 
records (R) in memory. After 2.0R records 
are written on tape, the string is completed. 
Note that with other merge techniques the 
minimum string length might be l.OR as a 
lower limit and “all” records to be sorted (an 
entire file) as the upper limit. When using the 
read-forward Oscillating Technique, the upper 
limit is set at 2.0R. When 2.0R records can 
not be formed, dummy records (and blocks) 
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are substituted to form 2.0R records. Since 
2.0R is the expected length, excessive dummys 
will not be formed. 2.0R may not be an opti- 
mum figure. It will, however, closely approxi- 
mate the optimum as the number of tape units 
increase. 

To the degree that the read-forward selection 
technique must generate dummy records to 
produce string lengths of 2.0R, it is less ef- 
ficient than the read-backward technique. As 
stated above, however, this reduction in effi- 
ciency should be slight under most circum- 
stances. 

THE PROBLEM OF WRITING ON TAPE 
USING THE READ-FORWARD OSCILLAT- 
ING MERGE 

The read-forward Oscillating Merge herein 
proposed requires that data be written on the 
front of a tape without destroying information 
further down the tape which will be subse- 
quently read. Depending on the computer sys- 
tem and Hie tape units, this may cause problems. 
For some computer systems, the read head may 
not be positioned properly due to start-stop 
time variations, automatic bypassing of un- 
writable tape (bad spots), effects of the erase 
head or differences in writing density. For 
systems with a tape rewrite feature, there 
would be no problem. For systems which allow 
tape erase, gaps on the tape can be program 
generated which will solve the problem. If 
neither of these features are available “hash” 
blocks can be inserted to protect information 
which must be subsequently read. The tech- 
nique may not be applicable to older systems 
where there is no programmed error control. 

Because of the wide variety of tape systems, 
this problem is not covered in more detail. It 
has been investigated, and it can be shown that 
the additional programming to cope with this 
problem is trivial. 

CONCLUSION 

The technque described offers the same 
potential as the Tead-backward Oscillating 
Technique, namely: as the number of itapes 
increase, this technique will perform the sort- 
ing task almost twice as efficiently as the N/2 
(Von Neuman) and more efficiently than the 


Polyphase or Cascade Merge Technique. The 
Read-Forward Oscillating Technique might be 
considered even in systems which allow back- 
ward reading. This is particularly the case in 
systems where tape reverse interlock is high. 

The foregoing presentation of the read-for- 
ward technique should not be construed as a 
recommendation of this technique to the ex- 
clusion of other methods. The selection of 
proper sorting techniques is a complex problem 
dealt with in detail in other papers *■ *** and 
which is not completely formalized at this time. 

VARIABLE LENGTH RECORD SORTING 
USING THE REPLACEMENT-SELECTION 
TECHNIQUE 

INTRODUCTION 

Another area where efficiencies in sorting 
can be attained, is in the internal sort. Since 
the number of merging passes is based on the 
number of internal strings, it is desirable to 
minimize the number of strings formed by the 
internal sort. The number of strings are mini- 
mized when the amount of data sorted at one 
time (the length of the string) is maximized. 

The sorting techniques in use today limit the 
string length for variable size records to the 
number of records that can be stored in 
memory. This section describes a sorting 
technique that permits an initial string to be 
formed that contains approximately twice as 
much data as can fit into the working storage 
available in memory.'' 

GENERAL 

The variable-record length internal sort seg- 
ment uses a modified version of the Replace- 
ment-Selection Technique.® This technique has 
previously been applied to the sorting of fixed- 
size records or to a variable-size record con- 
verted into a fixed format. 

The proposed technique temporarily “dis- 
jdms” a variable-aize record into one or more 
fixed-size “pieces” Crefemed te w ^^gmeaits) 
and at selection time comMnes the separate 
segments of the record. No expansion of the 
record occurs. 
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The selection of an “optimum” fixed-size seg- 
ment storage area is either determined by the 
user (based on his knowledge of the data) or 
assigned by the computer program. The opti- 
mum size is one which will produce the longest 
string on tape without causing average size 
records to be broken up into a large number of 
segments. Long strings are produced by select- 
ing a fixed storage segment size into which the 
records will fit without the need for excessive 
“fill” when the last “segment” of the record is 
moved into the fixed segment area. The selected 
segment size should be such that all keys for 
each record appear in the first disjointed seg- 
ment when subdivided. 

A brief review of the selection logic for con- 
ventional fixed size Replacement-Selection in- 
ternal sorting follows : 

Core memory is divided into 4 parts. 

(1) Instructions 

(2) Input Areas 

(3) Output Areas 

(4) A string building area (work storage 
area) 

The string building area is further sub- 
divided into pockets (or slots) equal to a fixed 
size plus about 12 characters to hold informa- 
tion used during the selection process. The 
selection process is similar to a tournament 
match in which the selected record at each level 
is referred to as a “winner.” Assume as an 
example, the string building area can contain 
12 records. The logic for selection is as fol- 
lows: 

1. A string building area is filled with records 

(R) from the input area. (See Figure 4.) 

2. The first record in the string building area 

is compared against the second, the third 


INPUT 

AREA 


RECORD 

1 

RECORD 

2 

RECORD 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 


OUTPUT 

AREA 


Figure 4. 

Example of Records in String Building Area. 


against the 4th, etc. In this manner a 
set of first round winners is selected. The 
addresses of the winners are stored. 

3. In a similar way succeeding rounds of 
winners are selected until one final winner 
is selected. Addresses of the winners of 
each round are recorded. This process is 
called initialization of the tree and re- 
quires R-1 comparisons. (See Figure 5.) 



'Figure 5. 

Logical Relationship of Records in Tournament. 

4. The final winner is moved to the output and 
its position in the string building area is 
replaced with a new record from the in- 
put area. 

5. If the new input record could be put out in 
sequence with respect to the last record 
that was put out, it can participate in the 
current tournament. Otherwise, it may not 
participate in the current tournament. 

6. Each subsequent record selected requires 
LogoR comparisons. This selection process 
is referred to as a “scan.” 

7. When no records can participate, the cur- 
rent tournament is over and a new string 
must be formed. 

USING THE REPLACEMENT-SELECTION 
TECHNIQUE FOR VARIABLE LENGTH 
RECORDS 

A record in the input area is divided so that 
one or more segments of it may be moved into 
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slots in the string building area. The first such 
segment, containing the key, is called the 
header; all other segments are trailers. Each 
segment of a record, except the last, contains 
as part of its indicative information, the ad- 
dress of the segment of the record which is its 
immediate successor. 

When the string building area is completely 
filled it is “initialized.” Only headers partici- 
pate in this initialization and only headers are 
winners in each “round” of the tournament. 
Initialization is concluded when a final winner 
has been determined. 

At this point, the winner is moved to the out- 
put area. Immediately, its place in the string 
building area is filled from the input area. 
Whenever a “segment” is moved from the 
string-building area, that slot is immediately 
filled. This is true irrespective of whether a 
header or trailer segment was moved out (Fig- 
ure 6). As segments are moved out they are 
rejoined in the output area into a variable 
length record. Any “fill” in the last segment is 
deleted at this time. 

After a string building area slot has been 
refilled, a “scan” takes place if one is neces- 
sary. It is necessary to “scan” if either a 


FIGURE 6 

RECORD SEGMENTS IN 
STRING BUILDING AREA 



Figure 6. 

Rsoorl Segments in String Building Area. 


header was moved out of the string building 
area or if a header was moved into the string 
building area. In other words, the only time a 
scan is not performed is when a trailer is re- 
placed by a trailer. 

After the scan, the next “segment” is moved 
into the output area according to the follow- 
ing rules : 

1. If the last segment moved to the output 
contained a reference to a successor (which 
must be a trailer), the successor is moved. 
This accomplishes the result of assembling 
the records which were segmented ini- 
tially. 

2. If the last segment moved was without a 
successor, the current final winner of the 
tournament (which must be a header), is 
moved to the output. 

CONCLUSION 

The logic for processing variable-length 
records using the Replacement-Selection Tech- 
nique requires an additional address in the 
tournament tree which is used to chain between 
segments of a variable-length record. Addi- 
tional logic is required to process variable 
length records. Records are segmented into 
header and trailers. After the selection process, 
they are recombined into a variable length 
record. The logic of the Replacement-Selection 
Technique requires modifications as described. 
As in the Replacement-Selection Technique for 
fixed size records, data is moved only twice. 
The overall logic of the Replacement-Selection 
Technique is retained and all its advantages 
are exploited.® 
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GLOSSARY OF SORTING AND MERGING 
TERMS USED WITHIN THIS PAPER* 

Backward read 

A feature available on some magnetic tape 
system whereby the magnetic tape units can 
transfer data to computer storage while mov- 
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ing in a reverse direction. Normally used, if 
available, during the external sort phases to 
reduce rewind time. 

Balanced sorting 
See: (T/2) -way merging. 

Cascade merging 

A technique used in a sort program to merge 
strings of sequenced data. Given T work tapes, 
merging is performed at T-1 on part of the 
data, T-2 on parts of the data, and so on. Strings 
of sequenced data are distributed in a Fibonacci 
Series on the work tapes preceding each merge. 
The effective power of the merge varies be- 
tween T-1 and T-2 but in all cases is less than 
the power of the Polyphase Merge. CF: effec- 
tive power of the merge. 

Collating 

Sequencing a group of records by comparing 
the key of one record with another record 
until equality, greater than, or less than is 
determined. 

Collating sequence 

The sorting sequence; a description of the 
sort key for a file of records. 

Collating sorting 

A sort which uses a technique of continuous 
merging of data until one sequence is developed. 

Computer limited 
See : process limited. 

Digital sorting 

A sort which uses a technique similar to 
sorting on tabulation machines (e.g., IBM 
Sorter). The elapsed time is directly propor- 
tional to the number of characters in the se- 
quencing key and the volume of data. Also 
“radix sort.” 

Effective power of the merge 

Equal to S|^, where S is the number of input 
strings and N is the average number of times 
each element of data is read. 

Fibonacci series 

A series where the current number is equal 
to the sum of the two preceding numbers : i.e.. 


* Source: A Glossary of Sorting and Merging Terms 
— Communications of ACM, May 1963. 


1, 2, 3, 5, 8, and so on. Some sort programs 
distribute strings of data onto work tapes so 
that the number of strings on successive tapes 
form a Fibonacci series. 

Fixed size records 

Denumerable file elements each of which has 
the same number of words, characters, bits, 
fields, etc. Cf : variable-length records. 

Generalized sort 

A sort program which will accept the in- 
troduction of parameters at run time and which 
does not generate a program. 

Generated sort 

A production program which was produced 
by a sort generator. 

Input tape(s) 

Tape(s) containing a file in arbitrary se- 
sequence to be introduced into a sort/merge 
program. 

Insertion method 
See: Sifting. 

Item 

See: Record. 

Key 

Also, sequencing key; criteria; sequencing 
criteria. The fields in a record which deter- 
mine, or are used as a basis for determining, 
the sequence of records in a file. 

Magnetic tape sorting 

A sort program that utilizes magnetic tapes 
for auxiliary storage during a sort. 

Major key 

The most significant key in a record. 

Merge 

A program that performs merging. 

Merging 

The forming of a single file of sequenced 
records from two or more files of sequenced 
records. 

Multifile sorting 

The automatic sequencing of more than one 
file, based upon separate parameters for each 
file, without operator intervention. 



606 PROCEEDINGS— SPRING JOINT COMPUTER CONFERENCE, 1964 


Multipass sort 

A sort program which is designed to sort 
more data than can be contained within the 
internal memory of a central computer. In- 
termediate storage, such as disc, tape, drum, 
etc. is required. 

Optimum merging patterns 

The determination of the sequence in which 
specific sorted tapes in a file should be proc- 
essed so as to minimize the total number of 
merge passes required to create a single file of 
sequenced records. 

Order of the merge 

The number of input files to a merge pro- 
gram. Also : power of the merge. 

Oscillating merge 

A technique used in a sort program to merge 
strings of sequenced data. For tape systems 
that permit backward reading, the effective 
power of the merge is equal to T-2. 

Output tape(s) 

Tapes containing a file in specified sequence 
as a result of a specific sort/merge process. 

Pass 

The processing of each file record once for 
the purpose of reducing the number of strings 
of sequenced records and increasing the number 
of sequenced records per string. 

Phase 

An arbitrary segmentation of a. sort program. 
Many sorts are segmented into .three phases: 
initialization phase, internal ^ phase, merge 
phase. 

Polyphase merging 

A technique used in a sort program to merge 
strings of sequenced data. Given T work tapes, 
merging is performed at the power of T-1. 
The effective power of the merge varies be- 
tween T-1 and T-2 depending cn the amount 
of input data and the number of strings. 

Power of the merge 

Also: way of the merge; order of the 
merge, the number of inputs to a merge pro- 
gram. Cf ; effective power of the merge. 


Process limited 

Also : computer limited. A sort program 
in which the execution time of the internal 
instructions determines the elapsed time re- 
quired to sort. Cf : tape limited. 

Radix-sort 

See : Digital sorting. 

Record 

The basic element of a file such that the sort- 
ing of file constitutes the re-ordering of file 
records ; also referred to as ‘"item,” 

Replacement-selection technique 

A technique used in the internal portion of a 
sort program. The results of the comparisons 
between groups of records are stored for later 
use. A selected record is placed on the output 
tape and a new record replaces the selected 
record. Given N records, a record is selected 
with 1 + lag^N tests ; the expected string length 
for random data is 2N records. 

Rewind time 

Elapsed time consumed by a sort/merge 
program for restoring intermediate and final 
tape files to original position. 

Scratch tape(s) 

See : Work tapes. 

Sequence break 

That point in a file between the end of one 
string and start of another. 

Sequencing criteria 
See: Key. 

Sequencing key 
See: Key. 

Sifting 

A method of internal sorting where records 
are moved to permit the insertion of records; 
also called ‘‘insertion method.” 

Sort 

The copying of a file of records into a cor- 
responding file in a specified sequence. 

Sort, external 

The second phase of a multipass sort pro- 
gram, wherein strings of data are continually 
merged until one string of sequenced data is 
formed. Cf : string merge. 
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Sort, internal 

The sequencing of two or more records with- 
in the centra] computer memory ; the first phase 
of a multipass sort program. 

Sort generator 

A program which generates a sort program 
for production running. 

String 

A group of sequenced records, normally 
stored in auxiliary computer storage; i.e., disc, 
tape or drum. 

String merge 

Program that performs merging. 

String merging 

The forming of a single string from two or 
more strings of sequenced records. 

(T/2)-way merging 

A technique used in a sort program to merge 
strings of sequenced data. The power of the 
merge is equal to T/2. 

Tape limited 

Also: I/O limited. A sort program in 
which the effective transfer rate of tape units 
determines the elapsed time required to sort. 
Cf : process limited. 

Tennis match sorting 
See: Replacement-Selection Technique. 

Tournament sorting 
See: Replacement-Selection Technique. 

Variable-length records 
Denumerable file elements for which the 
number of words, characters, bits, fields, etc. is 
not constant. Cf : fixed-size records. 

Von Neuman sort 
See: (T/2) -way merging. 

Way of the merge 
See : power of the merge. 


Work tape(s) 

Also: scratch tapes. Tape(s) used to store 
intermediate pass data during a sort program. 

Xmas tree sorting 

See: Replacement-Selection Technique. 
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INFORMATION REQUIREMENTS 
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INTRODUCTION 

For years, we who are interested in data 
processing have had a vague notion that one of 
the problems facing managers today is the lack 
of information. “If only I had known . . is a 
familiar phrase to all of us. Most of us would 
like to cause that phrase to become unfamiliar. 
Unfortunately, very few people, if any, have 
been able to state very explicitly how we should 
go about filling the information void. Progress 
in developing a methodology for designing man- 
agement information systems has been slow. 

So many people have written and said so 
much about management information systems 
that I would like to be sure that all of us have a 
similar notion in mind. Therefore, I would like 
to briefly define a management information sys- 
tem as — a collection of procedures, equipment, 
and persons associated together for the purpose 
of providing managers, who have the authority 
to make decisions that commit the firm or its 
resources, with descriptions of the elements 
relevant to the performance of their function. 
In other words, a management information sys- 
tem is a means of providing to the people who 
“need” it, information to guide them in the con- 
duct of the business. An ideal management in- 
formation system, then, would do at least these 
things : 

1. Provide each level and position of man- 
agement with all the information that can 
be used in the conduct of each manager's 
job. 


2. Filter the information so that each level 
and position of management actually re- 
ceives only the information it can and 
must act on. 

3. Provide information to the manager only 
when action is possible and appropriate. 

4. Provide any form of analysis, data, or in- 
formation whenever it is requested. 

5. Always provide information that is up to 
date. 

6. Provide information in a form that is 
easily understood and digested by the 
manager. 

Whether or not you can fully agree with this 
as a description of ^ managerial utopia, I hope 
that this will at least provide a frame of refer- 
ence that will help you to understand the use of 
conceptual models. 

PROGRESS IN MANAGEMENT 
INFORMATION SYSTEMS 

Using this ideal as a standard toward which 
our efforts are directed, we must recognize that 
no one has an ideal management information 
system. Just because we have not yet reached 
our ideal, however, we should not be discour- 
aged from attempting to make progress. It is 
not easy to produce a monumental improvement 
in the science of management. Nevertheless, 
considerable progress has been made. In fact, 
progress has been striking in the fields of hard- 
ware, software, mathematical techniques, and 
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the integration of procedural systems. Com- 
puters are available that can produce informa- 
tion at fantastic rates. High-speed printers can 
generate reports fast enough to inundate an en- 
tire committee. Teleprocessing has been devel- 
oped to the point where up-to-date information 
can be maintained in a central data file. All of 
Uiese various types of hardware can and will be 
improved, and all of them will probably be made 
less expensive or more efficient. However, I be- 
Keve that management is not utilizing the capa- 
Jtxility that is already available. 

Software has been developed so that special 
requests or changes in procedures need not 
totally upset a system. I am not beating the 
drums for FACT, COBOL, or any other pro- 
gramming language ; I am sure that all of these 
will be improved further. However, I believe 
that even software is available today to assist 
management more effectively than managers 
realize. 

Mathematical techniques have been developed 
for many things. We can optimize inventories, 
we can model markets, and we can predict the 
outcome of an election when only a small per- 
centage of the returns have been reported. Even 
so, this resource of mathematical techniques is 
neither well-understood nor fully utilized by 
managers. 

Many systems and procedures people have 
made substantial progress in developing inte- 
grated data processing systems. These systems 
chew up customers’ orders at one end and spew 
out bills of lading, invoices, and production 
orders from the other end. However, their pri- 
mary focus of attention has been on the routine 
operating documents of the business. They have 
made a substantial contribution in that they 
have permitted many managerial tasks to be 
directed by management instead of being sub- 
ject to the individual judgments of many oper- 
ating people. In inventory control, for instance, 
stock clerks are no longer responsible for in- 
ventory levels ; management has the key to the 
inventory control system and can adjust its 
mathematical judgment to management’s will. 

The point of all this is that in spite of tre- 
mendous progress on a countless number of 
fronts, managers are not truly helped. Gener- 
ally speaking, managers are working with the 


same variety of reports that they had several 
years ago. We believe that a major area of sys- 
tems design has not been given sufficient atten- 
tion. The problem of developing and defining 
the proper content of an information system 
has been slighted in the general work of systems 
analysis and design. One of the reasons for its 
having been slighted is that it typically falls in 
a no man’s land between the technician and the 
manager. The technician typically says to the 
manager, ‘‘All you need to do is tell me what you 
want, and I have the wherewitha4 to supply 
it.” Whereupon the manager, out of despera- 
tion, lack of foresight, or overconfidence, usu- 
ally supplies the pat answer; “Just what I am 
getting now only quicker and more current.” 
Some managers will honestly say: “I don’t 
know, but you’re the systems expert; can’t you 
tell me?” 

This no man’s land has created considerable 
difficulty, and very few people have risen to the 
challenge to try to do something about it. 

Another reason for little having been done 
about the ability to define the content of an in- 
formation system is that content is extremely 
difficult td work with. 

Liet me define content as the message or infor- 
mation that is contained in a communication or 
a record. Every report, analysis, or document 
has some meaning (or message) that transcends 
the actual format of the document or report. 
When we are concerned with defining the con- 
tent of a management information system, our 
concern is to determine the subject matter of 
the messages that managers should receive. 
When I talk about content, I am referring to the 
subject matter of reports and documents, re- 
gardless of how the data is displayed or arrayed. 

Information content is difficult to determine 
for managers. It is almost impossible to sepa- 
rate the content of managerial information from 
the field of organization theory. Organization 
theory and practice is thin ice; it is a subject 
that is emotionally charged for any manager in 
a real-life situation. It is also a subject on 
which highly qualified, reasonable men can be 
expected to disagree. This is an extremely diffi- 
cult area for anyone to work in, and particu- 
larly for a person who is scientifically and ana- 
lytically inclined. You cannot determine the 
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information that a manager needs without con- 
sidering his responsibility and authority. You 
must concern yourself with what he is, in the- 
ory, held responsible for, and how he discharges 
his responsibility and delegates his authority. 

Being concerned with the content of an infor- 
mation system forces us to be concerned with 
‘‘how to manage well.’’ We must concern our- 
selves with: How does a manager operate? 
How does he reach his decisions? How does he 
make his decisions effective? How does he man- 
age?” And perhaps an even more appropriate 
question is : “How should he manage?” No one 
has an adequate description or an adequate set 
of principles to tell us how to manage well. At 
best, there are a thousand platitudes that are 
collectively exhaustive and mutually contradic- 
tory. As if the lack of knowledge and under- 
standing of the subject (from any analytical 
point of view) were not bad enough, this is also 
a subject that managers have difficulty discuss- 
ing rationally and on which respected authori- 
ties disagree. 

In spite of the difficulty, we believe that this 
is one area in which substantial progress will 
be made in the next decade. If nothing else, 
managers and systems men will be forced to it 
by the availability of hardware, etc., and the 
fear that some competitor may do it. 

We believe that much of this difficulty is men- 
tal, and that we can attempt to develop a method 
for determining information requirements by 
trying to make the job mentally easier. There- 
fore, what we have to suggest is not so earth- 
shaking, but we believe it is a sound approach 
than can enable a poor, mere, mortal mind to 
somehow get around the subject of management 
and get into the business of defining informa- 
tion requirements. The approach that we have 
to suggest might be described as a research ap- 
proach. It is an approach that should enable 
analytically-inclined people to develop a defini- 
tion of their company’s information needs. If 
they want to, they can complete the entire job in 
an ivory tower, but the job will be done better 
if they have frequent reference to the regular, 
operating facts. The analysts should not lose 
touch with reality, but in fact, we have used 
this technique when there has been no reality 
to get in touch with. We have developed infor- 


mation requirements for nonexistent firms, and 
we believe that the results were extremely satis- 
factory. 

Of course, an approach that is frequently pro- 
ductive, but is not the research approach, is 
that of expertise. Most information systems 
and most informational improvements that are 
made today are made on the basis of expertise. 
Someone writes an article in a professional 
journal or a scholastic business review describ- 
ing the types of reports that they use. Managers 
read these reviews, think that they are wonder- 
ful and try to apply them to their business. The 
formats of the reports are face-lifted, and the 
manager tries to use them. Sometimes he has 
great success, and sometimes no success at all. 

There are many varieties of models, and they 
can be used in many different ways. Operations 
research people are accustomed to applying 
mathematical models to business problems. 
Some chemists and biochemists use physical 
models of what they imagine the structure of 
atoms to be. For now, we are interested in deal- 
ing with a conceptual model. It is a model that 
deals with words and imagery to enable us to 
focus our attention and communicate our im- 
pression about the operations and the manage- 
ment of an enterprise. 

MODELS OF OPERATIONS 

The first step in developing a conceptual 
model of an enterprise is to attempt to state the 
key operations that the enterprise must accom- 
plish in order to continue to function. We might 
describe an operation as a “gross unit of work 
specialization that is essential to the functioning 
of the enterprise.” The easiest way to deter- 
mine the appropriate operations for a concern 
is simply to begin to list all operations. Once the 
initial top-of-the-head list has been compiled, it 
should be juggled, combined, expanded and or- 
ganized until it consists of a number — probably 
between 10 and 20 — of operations of approxi- 
mately equal importance. 

As an example, I have chosen a wholesaling 
business. We might think of this as being a 
typical wholesaling business rather than any 
one specific wholesaler. Figure 1 is an initial 
list of potential operations for a hardware, 
drug, or appliance wholesaler. Many of these 
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captions appear to be steps in a procedure, and 
they should. Most business operations follow 
a routine. On the other hand, very few of the 
captions suggest a department or organiza- 
tional unit of a wholesaler. We believe that the 
major concern of management is the basic op- 


STOCK CHECKING 

MAINTAIN PRODUCT LINE 

ORDERING 

TELEPHONE SEaiNG 

SCHEIXJLING 

SALESMAN SELLING 

RECEIVING 

ADVERTISE AND PROMOTE 

STOCKING - BULK 

WRITE ORDERS 

HOLDING 

PICKING 

STOCKING - SHElf 

CHECKING 

OK RETURNS 

DaiVERY 

PICKUP REIURNS 

9IV0ICE 

CREDIT RETURNS 

Bia 

STORE RETURNS 

COLLECT 

SHIP RETURNS TO MFR. 

MAINTAW ACCOUNTS 

RESTOCK RETURNS 



Figure 1. Potential Wholesaling Operations. 



Figure 2. Operations of a ^‘Typical” Wholesaler. 


erations of the business. If management loses 
sight of this and becomes preoccupied with peo- 
ple, the business can become a very nice place 
to work, but an extremely disorganized mass of 
human relations. We want to concentrate on 
the basic operations. 

Figure 2 shows the operations that I finally 
selected as being the important ones for a typi- 
cal wholesaler. They are arranged in a flow- 
chart format. The reasons for this will be ex- 
plained later. Each block represents a job, task, 
function, mission, or as we have chosen to call 
them, operation of the firm. Take any one of 
these blocks out of the diagram and the business 
either ceases to exist, or is changed drastically 
in its methods of operation. 

In effect, this diagram of the operations of a 
wholesaler is a macroscopic view of the busi- 
ness. Businessmen are already acquainted with 
other models of their business. They are accus- 
tomed to an organization chart and they are 
familiar with accounting statements. All of 
these are macroscopic views of the business. 

The view of the firm that is illustrated in 
Figure 2 is specifically designed to focus our 
attention on the important things that must be 
managed. All of these things must be under the 
control and the close scrutiny of management. 
Each of these operations can be accomplished in 
a variety of ways. Each of these operations is 
a positive reason for spending money, not just 
an unavoidable expense — we have not included 
the payment of taxes or the negotiation with 
labor unions, nor, in fact, have we included the 
information system itself. 

The next step is to try to provide meaning 
to each of the names that has been put in a box. 
in our flowchart. Because these names mean 
different things to different people, it may ap- 
pear that we have omitted a significant opera- 
tion, or we might have difficulty agreeing that a 
certain activity is included in one operation 
rather than another. Before we go any further, 
we should provide a more careful delineation of 
each operation. This delineation should take 
two forms. The first form is a simple statement 
of the input and the output for each operation. 
Inputs and outputs are, in effect, the fences be- 
tween the operations. They serve not only to 
help define the operations themselves, but also 
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to make us certain that we have not omitted 
some significant activities between the opera- 
tions, Figure 3 shows the wholesaler's opera- 
tions with their inputs and outputs. Notice that 
the only original input is product ideas and the 
only final outputs are: goods with customers, 
payment, cash, and collections. Having all the 
operations and all the inputs and outputs to- 
gether in a single flowchart is a help, but it is 
still just a body of names which do not yet have 
sufficient meaning to enable us to probe the 
process of management. 

The second form of delineation for operations 
is a description of the suboperations that are 
contained within the major operation. Figure 4 
is a statement of the suboperations that are re- 
quired to transform the demand for specific 
products, customers ready to buy, and prefer- 
ence for our services into orders. These lists of 
suboperations can easily appear to be a descrip- 



Figure 3. Wholesale Operations with Inputs 
and Outputs. 


OPERATION DESCRIPTION 
4. GET ORDERS 


ORDERS SHOULD BE SECURED FROM RECOGNIZED CUSTOftrtERS. THESE 
CUSTOMERS SHOULD BE GIVEN ANY APPROPRIATE ORDERING AIDS SUCH 
AS WANT BOOKS, ORDER BLANKS, PREPAID ENVELOPES, ETC. FOR FOR 
CUSTOMERS IN REMOTE CITIES, LEASED TELEPHONE LINES WITH LOCAL 
NUMBERS MAY BE PROVIDED TO ENABLE CUSTOMERS TO PLACE ORDERS 
WITHOUT PAYING LONG-DISTANCE TOLL RATES, OR CUSTOMERS MAY BE 
ENCOURAGED TO CALL COLLECT, 

THE CUSTOMER'S INVENTORY MAY BE REVIEWED FOR HIM BY A REPRE- 
SENTATIVE OF THE WHOLESALER (POSSIBLY THE SALESMAN). THE 
WHOLESALER MAY MAINTAIN A PERPETUAL INVENTORY RECORD FOR THE 
WHOLESALER. AUTOMATIC DECISION RULES MIGHT BE PROVIDED AND 
REVIEWED BY THE WHOLESALER. 

A TELEPHONE CALLING SERVICE MAY BE ESTABLISHED TO SOLICIT ORDERS 
FROM CUSTOMERS. THIS INCLUDES: (1) ESTABLISHING AND SUPERVISING 
A TELEPHONE SALES STAFF; (2) SELECTING THE CUSTOlWERS TO BE CALLED 
AND ESTABLISHING A SCHEDULE FOR THE CALLS THAT IS GEARED WITH 
OTHER OPERATIONS OF THE WHOLESALER: (3) CALLING CUSTOMERS; (4) 
SUGGESTING ITEMS AND QUANTITIES TO BE ORDERED; (5) QUOTING PRICES; 
AND (6) PREPARING ORDER DOCUMENT. 

ORDERS MAY BE SOLICITED DIRECTLY BY SALESMEN WHO PREPARE THE 
ORDER DOCUMENT, GET THE CUSTOMER'S APPROVAl, AND SEND IT TO THE 
WAREHOUSE. 

"TURNOVER" ORDERS MAY BE SECURED BY ENCOURAGING CUSTOMERS TO 
SPECIFY OUR NAME TO MANUFACTURER'S SALESMEN. ALSO WORK WITH 
MANUFACTURER'S SALESMEN TO INCREASE THE PROPORTION OF THEIR 
ORDERS THAT ARE TURNED OVER TO US. 


Figure 4. Description of an Operation, 


tion of a general operating procedure. They are 
likely to include some of the things that we 
jotted down as potential operations in the very 
beginning. (See Figure 1.) If we were dealing 
with a specific firm, the description would be 
more detailed. 

This (Figures 3 and 4) completes a concep- 
tual model of the firm. Some of my associates 
and I have gone through this exercise for many 
firms and some command and control situations. 
We have found in every case that when two, 
three or four people sit down to prepare this 
conceptual model of an activity, they can, by 
constant negotiation give and take, agree upon 
a set of operations and definitions of operations. 
In short, a number of people with different back- 
grounds can follow this procedure to produce a 
single, well-defined, comprehensive view of the 
activities of a company. 


MODEL OF MANAGEMENT ACTIONS 

Now that we have a conceptual model of what 
the firm, as a whole, does, we would like to 
move on to a conceptual model of the functions 
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of management. We have an adequate state- 
ment of what the company does, but we must 
now decide how management manipulates the 
things that the company does in order to make 
it successful or unsuccessful. The basic ques- 
tion can be simply stated as: '‘How are the 
operations managed?'' 

Before getting too deeply involved in the con- 
ceptual model of management actions and their 
results, let's spend a little time poking into the 
lore of management. Many books have been 
written, from Frederick Taylor until the pres- 
ent time, about how managers can and should 
operate. The business reviews of our leading 
universities constantly publish articles about 
how to manage. We commend these sources to 
your attention. From our study of these sources, 
we Have generalized and concluded that man- 
agement must evaluate, organize, select, decide, 
train, and motivate. We can also recognize that 
management has at its disposal a number of 
resources. Resources can be summarized into 
the "four M’s" — money, machines, manpower, 
and materials. Somehow, citing these names 
(evaluate, etc.) for the things that managers 
do, and citing the names for the resources that 
management manages seems to be helpful, but 
it cannot be the final conclusion. These names 
and labels are not sufficiently specific to help us 
decide what information management needs in 
order to manage effectively. 

In addition to the lore of management, we can 
logically consider the things that a manager 
does in a typical work day. Those who are man- 
agers, and those who are familiar with the gen- 
eral operation of managers, can recognize that 
many of the things that a manager does do not 
have long-range significance for the company 
as a whole. For instance, a large part of the 
working day for a manager is spent communi- 
cating with those about him. This communicat- 
ing is, of itself, not truly significant. It is an 
unavoidable expense. The decisions that may 
result from those communications, or the eval- 
uations that can be made as a result of them, 
are significant, but the communications them- 
selves are not. Similarly, a good manager 
spends a large part of his time studying and 
reading reports. In reality, this is simply an- 
other form of communication. The amount of 
time that a manager spends actually making 


policy and making key decisions is a relatively 
small proportion of his total time. However, 
we believe that these are the significant things 
that a manager does that we would like to assist. 
With a good information system we might be 
able to reduce the amount of time that a man- 
ager must spend communicating and reading 
reports. But more significantly, we would like 
to assist him in making wise decisions about the 
truly important facets of the operations that he 
is managing. 

One way to help cut out some of the chaff is 
to recognize that we are attempting to deline- 
ate those important managerial actions that are 
taken by management as a whole. We are not 
trying to find out the specific actions that any 
one manager takes. If tve were, we would miss 
the significant managerial actions taken by 
committees. In many cases, a managerial action 
is taken at different levels of the organization. 
At each level, the manager has a different set of 
limitations within which he makes his decision. 
If we try to cope with all of these variations at 
once, we will be swamped. Our immediate con- 
cern is for the information that is required by 
management as a whole. We would like to post- 
pone until much later the actual job of deciding 
exactly who should receive which information. 

After many trials and errors, and consider- 
able study, we have concluded that the most 
significant managerial actions can usually be 
stated as "selecting a course of action," "adjust- 
ing a rate of expenditure (or level of effort)," 
or "allocating resources." In a sense, allocation 
of resources is simply a combination or special 
case of selecting and adjusting. 

To illustrate, a manager selects a course of 
action when he decides to use a particular chan- 
nel of distribution, or decides to acquire a par- 
ticular piece of production machinery. In 
general, these are discrete choices ; the manager 
must select one or more out of a number of 
alternatives. 

On the other hand, a number of elements can 
be adjusted over a continuous spectrum, such as 
prices or market area. These things can be 
adjusted up or down over a broad range. 

Allocation consists of assigning particular 
resources to particular activities. The word 
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“allocation"' suggests that the amount of the 
resource is limited in some way. The grandest 
sort of allocation is the allocation of money to 
each of the operations conducted by the firm. 
The general management of the firm must con- 
stantly reallocate its money among such activi- 
ties as selling, delivery, and inventory. 

Keeping in mind these general sorts of man- 
agerial actions, we can return to the conceptual 
model. We must examine each operation to 
determine the significant managerial actions 
that govern the quality of performance of each 
operation. If management acts wisely, the op- 
erations will be performed well. A management 
information system can not supply good judg- 
ment, but it can supply a sound base of facts 
to which managerial judgment can be applied. 
Figure 5 shows the managerial actions for the 
operation “Get Orders." Discovering these 
managerial actions for any operation is one of 
the most creative and imaginative steps in the 
process of constructing a conceptual model of 



Figure 5. Managerial Actions for an Operation. 


a firm. It takes time and it takes, stargazing. 
It is an iterative process thatf earn he improved 
each time it is reviewed. It is also an extremely 
educational process. If the managers them- 
selves can participate in the process, they can 
probably profit by it. 

There are a few sources that we can look to 
for assistance in pointing out the key mana- 
gerial actions. One of these is the detailed 
description of the operation. (See Figure 4.) 
We can review that description, looking for 
instances^ in which a manager must select from 
a number of alternatives, or for key decisions 
that are built into the regular conduct of the 
operation. We can also consider the resources 
that are required to perform the operation. It 
might pay to construct a list of the resources 
that are used in each operation. Resources 
might be ; 

Partieular skills 

Manual labor 

Existing facilities (physical capabilities) 

Known suppliers 

Existing public (customer) image 

Existing products 

Figure 6 lists the major resources that are used 
to get orders. Some of these resources are sub- 
ject to quantity manipulation. However, the 
rough proportions of the various resources are 
dictated by the nature of the operation itself. 
For instance, for a wholesaler, the selling ac- 
tivity cannot effectively use a large fixed capital 
investment ; almost the sole resource for selling 
is the highly skilled ability of a salesman in per- 
sonal contact with the customer. 

In considering the resources required to per- 
form an operation, there is a potential trap. 
That trap consists of considering money as a 
resource. No one can deny that money is a 
resource, but it is the one ultimate resource. 
Given sufficient time, it can be transformed into 

. SALESMAN’S TIME 
. TELEPHONE SALES CLERK' S TIIVC 
. FAVOR OF MANUFACTURERS 
. TELEPHONE FACILITIES 

Figure 6. Resources Used in Getting Orders. 
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any of the other resources. Therefore, in deter- 
mining the resources that are used in the per- 
formance of an operation, we should exclude 
money from our consideration. Otherwise, we 
run the risk of doing a superficial job. 

Another potential source of help in discover- 
ing managerial actions are job descriptions, or- 
ganization charts, financial statements, and 
interviews with managers. All of these aids 
should be used liberally. 

Next we would like to consider the results 
of each managerial action. Usually, at least one 
result of every managerial action is obvious 
from the statement of the action itself. If the 
action selects or adjusts, one result of the action 
is a commitment to a course of action or a 
change in the level of something. However, we 
are interested not only in the direct effects of 
the action itself, but also in the ancillary effects. 
Almost every managerial action involves more 
than one result. Many managerial actions imply 
a trade-off between two potential results. Some 
managerial actions simply have more than one 
effect. Figure 7 shows two managerial actions 
and their results. In total, when taken for all 
managerial actions, these form a conceptual 
model of a management of the firm. 



Figure 7. Action-Result Models. 


We slipped into using the word “result'' 
rather quickly. We might better call them ele- 
ments of performance, or parameters of per- 
formance. These are the factors or elements in 
the business that are influenced by the mana- 
gerial actions. It is important to think of the 
relationship between the action and its results 
as an influence. If you try to think of it as too 
direct a cause-effect relationship, you are likely 
to get bogged down. For instance, if you try to 
think of the purchase of particular delivery 
vehicles as directly causing the cost of delivery, 
you will get into trouble because delivery costs 
are also affected by wage scales paid to drivers, 
the distances the trucks are driven, and in fact, 
the number of deliveries that are made. Each 
of these things influences the cost of delivery, 
but none of them controls it. Similarly, in Fig- 
ure 7, many of the results are influenced by both 
actions, and if we added the action “adjust fre- 
quency of telephone calls" it would influence 
many of these results also. 

At this stage of the development of the con- 
ceptual model, we must be careful not to insert 
results that are too far-fetched. Moreover, we 
must recognize that some factors are influenced 
directly by a managerial action, and some are 
influenced only indirectly. For instance, almost 
all managerial actions have an influence on 
profit. Similarly, a number of managerial ac- 
tions influence sales volume. To include these 
as results will be helpful for only a few mana- 
gerial actions. We should concentrate on direct 
results. For instance, some managerial actions, 
such as “select products to sell," may directly 
affect the size of the market in which the firm 
competes. Others, such as “adjust advertising 
expenditures," may directly influence the share 
of the market that the firm enjoys. These ac- 
tions have an indirect or derived effect on sales 
volume. 

The results of managerial actions do not ex- 
ist in a void. They influence one another also. 
For instance, to continue with the last example, 
sales volume is influenced directly by the size 
of the market and the penetration of the 
market. With diligence and care, we can deter- 
mine which results are influenced by which 
other results, and, in total, we can develop a 
structure of results. Figure 8 represents a por- 
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Figure 8. Structure of Results 
(Partial and Simplified). 


tion of such a structure. It is drawn in flow- 
chart form, and it has a natural progression 
from left to right. Observe that sales area is 
the most causal, basic element of performance, 
and return on investment is the most final, com- 
mon element. 

The preparation of this structure of results 
will assist in simplifying the statements of re- 
sults of each managerial action. From the 
structure, we can infer that any one result has 
a chain of influences. Since we have the struc- 
ture, it would be redundant to repeat the chain 
for each managerial action. It is enough to note 
the left-most element in the chain as a result of 
an action. For instance. Figure 7 does not show 
number of salesmen as a result. 

Furthermore, the very exercise of trying to 
compile a complete structure of all results of 
managerial actions is likely to point out some 
results that have not been linked to any action. 
If the result stands at the beginning of a chain, 
we should try to find the managerial action that 
influences it. 

Figure 9 shows the same structure of results 
as does Figure 8, but superimposed on it are the 
managerial actions that influence the results. 
This exercise can help us to understand how the 
results of one action can influence another ac- 
tion. It can also help us to see the managerial 
actions that are influenced by specific factors. 
For instance, the action ‘'adjust working hours 
of salesmen” is affected by “salesmen’s travel 
time” and “call time.” 


The flowchart is a very cumbersome device 
to display a complex structure of results. Fig- 
ures 8 and 9 are simple only because they depict 
so few factors. We might try to simplify the 
job by using a precedence matrix such as Fig- 
ure 10. A primary advantage of a matrix form 
of documentation is that it permits us to say 
something about the nature of the relationship 
between an action and its results, and between 
various results. Some of these relationships are 
clearly defined. After all, some of them are 
taken almost directly from accounting practice, 
and are, therefore, susceptible to the accounting 
definitions. We know that some other relation- 
ships are proportional, even though we may not 
know what the exact proportion is. The inter- 



Figure 9. Managerial Actions Superimposed 
on the Structure of Results. 
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sections of the matrix can contain all that i^e 
know about the nature of the relationship. If 
we could determine the exact mathematical 
function that relates each of the actions to its 
results and the results one to another, we would 
have a fabulous mathematical model of the firm. 
Unfortunately, the nature of many of the rela- 
tionships is simply unknown. 

This exercise completes the conceptual model 
of the firm, its management actions, and the 
results of those actions. The model can be used 
as a general guide to understanding how the 
firm works. It might be used as the basis for a 
mathematical model of the firm. 

DETERMINING INFORMATION CONTENT 

The major purpose of creating the model in 
the first place was to assist in determining the 
information that is needed by the management 
of the company to manage the company well. 
This can be done by simply reviewing the 
action/result models. (See Figure 7.) We can 
consider each element of the model as a require- 
ment for managerial information. We would 
like to measure the managerial actions them- 
selves — how much action is taken, when was it 
taken, etc. — and we would like to measure each 
of the results of the action. A comprehensive 
information system will contain each of these 
measurements. In addition, it will contain many 
similar measurements of competitors' business 
practices. 

In order to be more specific, we need to re- 
turn to the diagrams of actions and results. One 
of the actions in Figure 7 is “adjust frequency 
of salesmen's order solicitation." This auto- 
matically suggests the question, “How often do 
salesmen solicit orders?" The simplest answer 
to that question is the total number of calls 
made by all salesmen in a month. Of course, 
we might want a finer breakdown — number of 
calls made by each salesman, and number of 
calls made on each customer, A tally of the 
number of calls is a frequency from the firm's 
point of view, but we might want to turn it 
around and look at it from the customers' point 
of view. How many solicitations does the aver- 
age customer receive in a month ? 

Call frequency is not hard to measure. In 
fact, if we tried a little bit, we might even be 


able to learn how frequently our competitors 
solicit orders. If we asked our customers who 
else they buy from, and how frequently each 
competitor's salesman calls, we can expect some 
customers to refuse to answer, and some cus- 
tomers to give us wrong answers. But if we 
carefully compile the data that we do get, we 
can expect jto be better informed than we would 
have been" otherwise. 

If we dwell on the subject longer, we might 
think of some other significant measures of the 
action itself, but we should also be concerned 
with the results of the action. “Salesmen’s 
travel time" would be fairly easy to measure. 
All we need to do is ask the salesmen to keep 
track, for a month, of the time of day at which 
they leave one account and arrive at another 
account. We might even ask the salesmen to 
take an hour or so and prepare a “typical" 
itinerary with an estimate of the travel time 
between accounts. Either way, this is not an 
onerous chore, and it might even be worthwhile 
for the salesmen to go through the exercise just 
for what they would learn from it. Then we will 
need to compile it to learn the travel time. 

“Salesmen's travel expenses" are regularly 
measured by most firms. We should observe in 
passing that there is a close connection between 
travel time and travel expenses. Furthermore, 
both factors, time and expenses, cannot be 
attributed to individual customers. Any at- 
tempt to determine the amount of travel time 
or travel expense that is incurred on behalf of 
any one customer is bound to be arbitrary. 
Neither of these factors lend themselves to 
interpretation from the customers' point of 
view. 

If possible, we would like to go beyond meas- 
uring the action and its results. We would also 
like to consider the characteristics of the rela- 
tionship between them. For these particular 
actions and results, we have a pretty good no- 
tion of the basic relationship — as the calling 
frequency is increased, the travel time and ex- 
pense increase also. Any information beyond 
this intuitive feel will be difficult to acquire. We 
might ask a few salesmen to play a game with 
us and prepare hypothetical itineraries for the 
manner in which they would cover their terri- 
tories if they were to cut their number of calls 
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to one-third of their present frequency. Then 
do it again for two-thirds, three-halves, and 
double. A compilation of these estimates should 
give us pretty good information about the rela- 
tionship between the action, “adjust frequency 
of salesmen’s order solicitation,” and the re- 
sults ,“salesmen’s travel time” and “salesmen’s 
travel expenses.” 

Another important characteristic of informa- 
tion is that it must be related in time, and in 
many cases, it must be understood “through” 
time. Each action and each result must be 
thought of as a time series. We want more in- 
formation than just the present status. We also 
want to know how frequently we called on cus- 
tomers last year and the year before; and we 
want to know what frequency is planned or 
expected in the future. In addition, we want 
comparable information about travel time and 
expenses. If we can get nothing better, we 
might even use an historical comparison to tell 
us about the relationship between call frequency 
and travel time and expense. 

The process of defining information require- 
ments — the content of an information system — 
IS to find a w'ay to measure each managerial 
action, each result, and each connection between 
an action and a result. Then see if a comparable 
measurement can be found for competitors. We 
must be certain that the information can give 
an historical perspective and a glimpse of the 
future. In many cases, this method will lead us 
to unexpected information requirements. 

How many sales managers do you know who 
could give you a satisfactory, quantitative an- 
swer to the question, “How often do salesmen 
solicit orders?” Most management information 
systems pass up this information completely, 
and yet, if we have any faith in ou^' model, we 
can see that the action that is measured by the 
answer to that question has a far-reaching effect 
upon salesmen’s time and expenses, and upon 
sales volume. 

The job of translating these information re- 
quirements into reports and files is no small 
job, but it is a more familiar one. Systems and 
procedures people have been doing this sort of 
thing for years. Anyway, we have not yet 
found a way to have conceptual models help 
with this part of the job. 


CONSTRUCTING CONCEPTUAL 
MODELS 

The procedure for developing a complete con- 
ceptual model is easy to work with mentally. It 
progresses from one stage to the next, and at 
each stage we can focus our attention on only 
a few factors at a time. In the early stages, 
these factors are abstract. They are so abstract 
that they can apply with little modification to a 
number of different economic enterprises. But, 
as the early framework is expanded and com- 
pleted with more details, the conceptual model 
begins to apply only to the economic enterprise 
for which it is designed. 

I would not mean to imply by these words 
that conceptual models are easy to develop. It 
is one of the most rigorous mental exercises that 
I have run into. To complete a model requires 
creativity, imagination, insight, and judgment. 
I firmly believe that no one person can construct 
a good conceptual model. The best way is to de- 
velop one through individual effort which is 
followed up with a review by one or more per- 
sons. If this review is not available, the next 
best alternative is to attempt to complete sev- 
eral stages of development of a model. Then, 
put it away in a desk drawer and come back to it 
in six months. By this time, you may be a dif- 
ferent enough person to review your own work 
adequately. 

Don’t get fooled by all the flow charts and 
geometric shapes. They are not the conceptual 
model. The model exists in the mind. The lines, 
words, and shapes are only a means of com- 
municating and permanently recording what 
the mind has conceived. 

Recall that we are dealing with a model, and 
a model is something that simplifies reality. The 
model does not faithfully reproduce every attri- 
bute and characteristic of the original; if it 
did, it would be a duplicate not a model. A 
wind tunnel model attempts to reproduce the 
exterior shape of an airplane or flying object 
so that engineers can observe the performance 
of the shape in moving air. A mathematical 
model for inventory control does not reproduce 
all the characteristics of the real world; it re- 
produces only those characteristics which are 
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felt to be of primary importance in controlling 
inventory. This same sort of attention must be 
applied to conceptual models for determining 
information requirements. The developer must 
continually weed out and separate trivial de- 
tails from important generalities. For instance, 
back in Figure 7, we might have shown ‘‘num- 
ber of salesmen's direct orders" as an element 
of performance, but we cannot find anything 
significant about that number. 


At each stage of the development of a model, 
the analyst should ask himself : “Is each of the 
elements or factors which I have written on this 
page of approximately equal importance ?" 
Since there is no absolute scale of importance, 
this question cannot be answered conclusively. 
That is why our model is a conceptual one. It 
deals with words, abstractions, and impressions. 
As such, it is subject to arbitrariness and judg- 
ment. Even so, it is worth developing. 
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